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ABSTRACT 

The problem discussed in this dissertation is the development of an efficient 
method for visual navigation of autonomous vehicles. The approach is to signifi¬ 
cantly reduce the expensive computational time of landmark detection by straight¬ 
edge features. A novel, fast straight-edge-detection method for use in autonomous 
vehicle navigation and other image-understanding applications is presented. Straight 
edges in gray-scale images are detected using a new direction-controlled edge tracking 
method, which gives precise estimate of the endpoints. To significantly reduce the 
number of exhaustive pixel computations, a random-hitting method using a pseudo¬ 
random number generator is proposed. Only if a generated pixel is significant do 
we start tracking the edge containing that pixel. To overcome the “noisy” gradient 
direction information, a robust least-squares linear fitting method is used to control 
the tracking process. 

The results of the algorithm show how it is efficient for landmark detection, 
which is important for motion control of autonomous vehicles. Thus the new method 
is implemented as a component of the image-understanding system in the autonomous 
mobile robot Yamabico-11 at the Naval Postgraduate School. 

An efficient world-modeling method based on the 2D model of the environment 
of the vehicle, including the heights of vertical edges in the environment, is presented. 

This modeling method is implemented with the new edge-detection method to im¬ 
prove the efficiency of the pose-determination algorithm (pose is a combination of 
the position and orientation of the camera), which is an essential task in the area of 
autonomous vehicle navigation. 
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I. 


INTRODUCTION 


A. BACKGROUND 

Computer vision is a field of science which deals mainly with the representa¬ 
tion, extraction, and manipulation of objects and information from digital images. 
The goal is to devise a way for a computer to interpret images in a useful way. Nat¬ 
urally, what is useful will depend on the application at hand [Ref. 1]. A computer 
vision system is considered the enterprise of automating and integrating a wide range 
of processes and representations used for vision perception. It can be viewed in terms 
of the following main functional components [Ref. 2, 3]: 

1. Image acquisition. 

2. Image processing (transformation, encoding, compression, and transition of 
images). 

3. Image understanding (image analysis, object detection, and pattern classifica¬ 
tion). 

4. Geometrical modeling. 

5. Output or display. 

Image acquisition transforms the visual image of a physical object and its in¬ 
trinsic characteristics into a set of digitized data which can be used by the processing 
unit of the system. This tcisk can be considered as comprising illumination, image 
formation or focusing, sensing, and formatting camera output signal. Image process¬ 
ing is concerned with transforming an image in one storage area to a new (processed) 
image in another storage area. Some basic image-processing operations include fea¬ 
ture enhancement, inversion, brightening, noise suppression, and compression. Image 
understanding mainly deals with the analysis of images for the purpose of extracting 
useful feature information. A key element in the task of locating features is that of 
edge detection. Identifying the locations of edges is essential for object recogni¬ 
tion. Geometric modeling^ a part of most of computer vision systems, describes the 
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structures of objects related to the application domain for which the computer vision 
system was designed. Output capabilities are provided by most of the vision systems 
to assist the users in making process management decisions and verifying the results. 

Over the past three decades, computer vision has evolved into many applica¬ 
tion domains. In each domain, a set of objects, tasks required, and knowledge sources 
should be specified. In the aerial images domain^ examples of objects are terrain and 
buildings. The tasks required include resource analysis, weather prediction, spying, 
missile guidance, and tactical analysis. The knowledge sources for that domain are, 
for example, maps and geometrical models of shapes. In the robotics domain^ the 
three-dimensional indoor or outdoor scenes and mechanical parts are examples of the 
objects related to this domain. Tasks required in such a domain include the iden¬ 
tification of objects in the scene for obstacle avoidance, self-localization, and parts 
assembly. For the medical domain, the tasks are mainly diagnosis of abnormalities 
based on objects, such as body organs, and the design of anatomical models [Ref. 3]. 

B. IMAGE UNDERSTANDING 

The ultimate goal in several computer vision applications is the extraction of 
useful information from the image data. The description, interpretation, and under¬ 
standing of the features of an image scene altogether constitute image understanding. 
For example, a computer vision system with an image understanding capability used 
in industrial applications (such as assembly lines) should distinguish parts and list 
their features (as in size and number of holes). Moreover, the system can observe the 
parts, determine whether they are within specifications, and generate command sig¬ 
nals according to the determined result. For a robot vision system, the system should 
help the robot recognize and interpret various objects and their spatial relationships 
in a scene. Motion control and execution should be performed through visual feed¬ 
back from the vision system. Image understanding basically involves the study of 
feature extraction, segmentation, classification, and interpretation. A method used 
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for both feature extraction and segmentation is edge detection. It is a fundamental 
problem in image understanding, and the simplest way to identify objects and obtain 
most of the relevant information about them such as shapes, locations, and distances 
from the camera. In this dissertation, we consider finding an efficient straight-edge 
detection method as a basis for autonomous vehicle visual navigation. Background 
information on edge detection will be presented in this chapter, and the principle 
of gradient-based edge detection will be presented in the next chapter, followed by 
the methods used. A typical image-understanding system is shown in Figure 1. The 



Figure 1. An image-understanding system. 


input image is first preprocessed (for instance, by enhancement or proper representa¬ 
tion of the data) [Ref. 4]. The image is then analyzed using segmentation and feature 
extraction. The results are fed into a classifier or an interpreter in order to obtain 
useful information, such as the relationship between different objects in the scene, 
and to let the system take the proper action. 

1. Feature Extraction 

In the process of feature extraction, it is necessary to extract certain features 
of objects from the scene. In an indoor environment image, as shown in Figure 2 for 
instance, the system should be able to extract specific features of the indoor struc¬ 
tures, such as the door locations in the hallway. These features serve as landmarks 
for visual navigation tasks. By way of comparison, in an X-ray image, the gray-level 
amplitude represents the absorption characteristics of the body masses which enables 
the discrimination of bones from tissues, or of healthy tissues from diseased tissues 
[Ref. 4]. 
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Figure 2. An indoor environment image. 


2. Segmentation 

The segmentation technique is required by a vision system in order to isolate 
the objects from the background. Segmentation is often imagined as the splitting of 
the image into a number of regions, each having a high level of uniformity in some 
designated parameter such as brightness, color, or texture [Ref. 5]. One method of 
segmentation is the region-growing technique [Ref. 6]. The goal of region-growing 
is to use image characteristics to map individual pixels in an input image to sets of 
pixels, called regions [Ref. 3]. In this technique, pixels are placed in a region on the 
basis of their similar intensity. Similar adjacent regions are then merged sequentially 
to form larger regions [Ref. 4]. This technique is found to be useful with the aid of 
edge detection. However, the technique tends to be quite computationally intensive 
[Ref. 5]. Another method of segmentation is that of template matching where an 
image is matched against templates from a given list in order to locate objects [Ref. 
4]. Another method of segmentation is thresholding the image at a particular intensity 
level. This method results in setting the objects as black figures on a white background 
according to binary images [Ref. 5]. 
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3. Classification and Interpretation 

These tasks of image understanding are concerned with the ability of the vision 
system to interpret the obtained information from segmentation or feature extraction. 
This is done to provide a description of the objects in an image scene in a useful way. 
Several techniques are used in these tasks, including pattern recognition, supervised 
classification, statistical classification, clustering, decision trees, and similarity mea¬ 
sures [Ref. 4]. An important method of interpretation by an image understanding 
system for autonomous mobile robots is that of matching between line segments 
(edges) of objects in an image and those of an indoor structure. 

C. AUTONOMOUS VEHICLES AND VISION 

Among the real-world applications of Artificial Intelligence are those of the 
autonomous mobile robot vehicles. They are defined as those vehicles that are capable 
of intelligent motion and action without requiring further human intervention [Ref. 
8]. Many individual components and research fields can be integrated for autonomous 
vehicle development. Such components include motion planning (for example in [Ref. 
9, 10, 11, 12]), image understanding (as in [Ref. 13, 14]), control (some examples 
are in [Ref. 15, 16]), kinematics [Ref. 17], sonar technology [Ref. 18, 19] , electronic 
circuits, system architecture, programming, neural networks [Ref. 20], and fuzzy logic 
and control [Ref. 21]. 

Research in the area of autonomous vehicles is of major practical interest. 
There are several potential applications of autonomous vehicles, such as manufactur¬ 
ing, construction, waste management, space and undersea exploration, assistance for 
the disabled, intelligent wheel-chairs for the handicapped, medical surgery, remote 
repair and maintenance, military operations, and material-handling system for offices 
and factories [Ref. 8, 22]. 

In military applications, emphasis is placed on using autonomous vehicles for 
handling a myriad of hazardous duty assignments [Ref. 9] like mine searching, UXO 
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clearing, reconnaissance, and fire fighting. A semi-autonomous vehicle called Shepherd 
is under development at the Naval Postgraduate School for research in mine detection 
and UXO clearing. 

Many of the above tasks require image understanding as an essential compo¬ 
nent, especially for visual navigation and guidance control of the vehicles in either 
outdoor or indoor environments. We define the visual navigation of an autonomous 
vehicle as “the ability of an autonomous vehicle to perform efficiently its global and 
local motion planning in a partially known environment under the guidance of a vi¬ 
sion sensor”. The visual navigation of autonomous vehicles in an indoor environment 
involves several aspects. Each can be considered as a stand-alone problem. Im¬ 
age analysis, model interpretation, pose estimation, object recognition, and obstacle 
avoidance are examples of these problems. 

1. Vision-Guided Navigation 

Vision-guided navigation was the focus of a number of researchers. Lebegue 
and Aggarwal [Ref. 23] developed an algorithm for automatically constructing CAD 
models of a structured scene as imaged by a single camera on a mobile robot. The 
scene to be modeled is assumed to be composed mostly of linear edges with particular 
orientations in 3D. Bishay, Peters and Kawamura [Ref. 14] developed a system that 
can detect stationary objects in an indoor scene, using log-polar mapping for the 
purpose of building a map for the environment to guide the robot from one place 
to another. This system was developed so that the size of the log-polar map is 
64 X 64 pixels. This is not suitable for the high-resolution images currently being 
used. Buger and Bhamus [Ref. 24] investigated the problem of estimating the camera 
motion parameters from a 2D image sequence that has been obtained under combined 
3D camera translation and rotation. They solved the problem using the concept of 
“Focus Of Expansion (FOE) feasibility”. Taylor and Kriegman presented in [Ref. 25] 
an algorithm that would enable a mobile robot equipped with a visual recognition 
system to carry out a systematic exploration of an unfamiliar environment in search of 
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one or more recognizable targets. Huttenlocher, Leventon, and Rucklidge presented 
in [Ref. 26] a method of using a sequence of monocular images for navigating a 
robot from an initial position to a specified landmark in its visual field. Lazanas and 
Latombe [Ref. 27] investigated the problem of developing navigation algorithms that 
rely on the ability of the robot to recognize and localize a specific set of landmarks at 
known positions in the environment. The problem of exploring an unknown polygonal 
room with a bounded number of polygonal obstacles was investigated in [Ref. 28] by 
Deng et al. Other applications of image understanding as an essential component have 
been recognized in many areas. For example, a mobile robot system is presented in 
[Ref. 29], on which a human can ride and share access to the environment through 
visual navigation. In [Ref. 30] a robotic system for guiding the blind on the road or 
sidewalk is presented. 

Some projects were dedicated for the research in mobile robot vision. The 
FINALE vision-guided mobile robot system, which navigates in an indoor environ¬ 
ment at a speed of 10 meters per minute in the presence of obstacles, was presented in 
[Ref. 13, 31]. This model-based system matched landmarks in the scene with features 
extracted from the images, to perform self-localization. However, it was limited to 
stop-and-start motion, since the robot had to be motionless to obtain an accurate 
video image. The NAVLAB [Ref. 7] is a commercial truck converted into a robot 
vehicle. It contains several on-board general-purpose computer workstations, as well 
as a WARP parallel architecture computer. Its sensors include color TV cameras and 
an ERIM laser range finder. Another approach to visual modeling and recognition is 
concerned with developing techniques for automatically building a representation of 
a complex physical environment (e.g., natural terrain populated with objects such as 
roads, bridges, bushes, and trees) based on data from imaging sensors and previously 
stored knowledge. UMass Mobile Perception Laboratory (MPL) [Ref. 32] is a simi¬ 
lar effort to build a landmark-based autonomous vehicle system composed of several 
independent processes, each solving a particular aspect of the navigation problem. 
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These are then integrated into a fully capable autonomous vehicle for both on-road 
and cross-country navigation. The integrated system was designed to “react” in real 
time to a changing environment and to “reason” about ways to achieve its goals. This 
autonomous vehicle performs pose determination in 30 seconds to 1 minute, which is 
very slow in real time. At 5 MPH an obstacle at 30 feet can be avoided if the obstacle 
is at least 2 feet high. 

The major problem with image understanding tasks in robotics is the expensive 
computation time required for image processing and then for image analysis. The 
computation time is very critical, since slow performance of image understanding 
algorithms leads to hazardous situations in real time. Developing efficient image 
understanding methods is one of the more challenging problems in both computer 
vision and robotics research. 

D. PROBLEM STATEMENTS 

The objective in this dissertation is to investigate a new efficient model-based 
image understanding method for an indoor autonomous vehicle, for both military and 
industrial applications, through a fast straight-edge detection algorithm. 

This work is part of the research project for the Naval Postgraduate School’s 
autonomous mobile robot vehicle Yamabico-11. Several previous efforts in motion 
planning [Ref. 9, 10, 11], automated cartography [Ref. 33], sonar-based obstacle 
avoidance [Ref. 34], and image understanding [Ref. 35, 36, 37, 38] have made con¬ 
tributions to the project. We expect this research to produce significant and efficient 
methods to integrate image understanding with previous efforts and the current soft¬ 
ware system on the robot. 

1. Problem Definition 

The problem addressed herein is: How can the model-based image understand¬ 
ing task for an autonomous vehicle be accelerated? The problem is decomposed into 
the following sub-problems: 
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1. How can the straight-edge detection, as a basic function in image understand¬ 
ing, be performed efficiently? 

2. How can the vehicle’s world environment be modeled so that its interpretation 
function within the image-understanding task is executed rapidly? 

3. How can the correct pose of the vehicle be found efficiently, given the con¬ 
structed model, the estimated pose, and the camera parameters? 

2. Assumptions 

The following assumptions and limitations were incorporated into the research 
in order to focus the work on the image-understanding problem; 

• A single CCD (Charge-Coupled Device) vision system is used. 

• The camera parameters (image size, focal length, physical size of CCD image 
plane) and field of view are already set and known. 

• The vehicle is operating in a partially known, orthogonal indoor world; walls, 
ceilings, door frames and floor meet at right angles. (These are perfectly suit¬ 
able for the current operating environment of the testbed vehicle Yamabico-11, 
which is the second floor of Spanagel Hall at the Naval Postgraduate School.) 

• The vehicle has dead reckoning capability. 

• The vehicle’s operating environment is bounded by a clockwise polygon (see 
Chapter VI). 

E. PREVIOUS WORK 
1. Edge Detection 

Many researchers have worked toward solving the straight edge detection prob¬ 
lem. The ability to solve this problem is required in robotics and in other areas. For 
instance, matching between straight edges in an image plane and those in a world 
model is especially important in model-based real-time robot navigation’and in object 
recognition. 

The approaches to the problem of edge detection vary from one application to 
another. Some are highly dependent on the knowledge of the images being processed. 
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Bolles [Ref. 39] described the idea of adjusting a boundary estimate by carrying lo¬ 
cal searches out at regular intervals along directions perpendicular to the estimated 
boundary. Another method was developed for some medical applications [Ref. 40] 
using divide-and-conquer for boundary detection. The Hough transformation for de¬ 
tecting lines and curves, introduced in [Ref. 41], is considered one of the standard 
techniques. Although this method by itself does not give the endpoints of the detected 
edges, Dudani and Luck [Ref. 42] included a least-squares fitting procedure into this 
method to obtain the endpoints of the detected line. Martelli proposed a heuristic 
search method to follow edges in images and, by doing so, converted the problem of 
edge finding into a graph searching problem [Ref. 43] using the A’ algorithm [Ref. 
44, 45]. Some applications of dynamic programming were described in [Ref. 3, 46]. 
The contour-following method can find regions for a given binary image even with¬ 
out any knowledge on the boundary shape [Ref. 47]. The method was adapted to 
gray-level images, as described in [Ref. 48]. The algorithm presented in [Ref. 49] 
requires scanning the whole image to obtain line-support regions and then determine 
the location and properties of the edges. Some other researchers followed this ap¬ 
proach with slight modifications [Ref. 50, 51]. Other methods also used the whole 
image scanning strategy with different edge detectors and modules [Ref. 52, 53]. A 
contour-tracing algorithm was proposed in [Ref. 54], based on a priori knowledge 
about the edges to be searched. Marr and Hildreth [Ref. 55] discussed a theory of 
edge detection using a two-dimensional Gaussian operator. Although this theory was 
highly influential in the following few years, it has the problem of expensive com¬ 
putational time since it requires preprocessing the images in order to study them at 
different scales [Ref. 5]. The computational approach to edge detection by J. Canny 
[Ref. 56] has emerged from this approach with emphasis on the optimality of the 
detector at any scale and on dealing with different signal-to-noise ratios in the image. 
He formulated the criteria desired in any edge detection operator as good detection, 
good localization, and only one response to a single edge. A variation of the Canny 
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operator is presented in [Ref. 57] to optimize the composite criteria using the calculus 
of variations. These approaches, and others discussed in [Ref. 5], are concerned with 
signal-to-noise ratios and the accuracy with which edge magnitude and orientation 
can be estimated. Davies and Johnstone [Ref. 58, 59] used the Sobel operator in the 
edge-detection task on enhanced images of their project, with slow execution time. 
A general discussion about using parallel processing and hardware implementation 
to speed the image processing tasks is found in [Ref. 5]. However, our approach in 
speeding up the edge detection is based on a single processor computer system. 

2. Pose Determination 

For an autonomous vehicle moving in a certain environment, some positional 
errors occur and accumulate. Thus, this problem of robot localization (pose deter¬ 
mination) has received the attention of several researchers in the mobile robot field. 
Some used model-sonar based navigation [Ref. 11, 60] to solve the problem. We will 
give a brief summary of previous efforts in the field of visual navigation. 

Fischler and Bolles [Ref. 61] solved for the lengths of rays from the optical cen¬ 
ter of the camera to the points in 3D space. The closed-form solution they presented 
is quite complex. Using triangle pairs and the Hough transform, Linnainmaa et al. 
solved the problem of finding the coordinates of the 3D points in camera coordinates 
[Ref. 62]. The approach used in [Ref. 63] was to decompose the solution into two 
stages, for rotation first and then for translation, with another method for simulta¬ 
neous rotation and translation. With the scanning method for edge detection and a 
complete 3D model, Peterson [Ref. 35] used Sugihara’s principle [Ref. 64] in the pose 
determination. Using a CCD video camera coupled with a conic reflector, Pegrad 
and Mouaddib [Ref. 65] obtained omni-directional views and presented a method for 
localization from natural landmarks obtained in the navigation area. Another ap¬ 
proach was taken in [Ref. 66] by converting a sequence of image measurements into 
a representation of the robot’s pose. Although this method does not require explicit 
environment modeling, it has the overhead of a collection of training examples, each 
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of which specifies the video image observed when the robot is at a particular location 
and orientation. 

F. ORGANIZATION OF DISSERTATION 

The remainder of this dissertation is organized as follows: Chapter II discusses 
the principles of image representation and edge detection as a basic function for 
several image understanding algorithms. Chapter III presents edge detection using 
the concept of gradient regions by image scanning. In Chapter IV, we present a 
novel and robust algorithm for edge detection using direction-controlled edge tracking. 
Chapter V describes a global method for detecting all important edges by introducing 
the concept of random hitting. In this method, finding an edge multiple times is 
eliminated. Chapter VI describes world modeling of the operating environment of an 
autonomous vehicle, and shows a simple method for modeling the environment by 
what is called a 2|D model. Chapter VII discusses how to use this model and the 
robust edge tracking method to perform pose determination. In Chapter VIII, we 
give a brief description of the hardware/software system of the autonomous mobile 
robot vehicle Yamabico-11 at the Naval Postgraduate School, with emphasis on the 
image system mounted on the robot. Chapter IX summarizes the contributions of 
this dissertation with recommendations for future work. 
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II. 


PRINCIPLE OF STRAIGHT-EDGE 
DETECTION USING GRADIENTS 

A. IMAGE REPRESENTATION 

An image function is a mathematical representation of an image. Generally, a 
gray-scale image T is represented in terms of the intensity (or brightness of the gray 
level) f{p) for each pixel p = (x, y). Such a gray-scale image is W pixels wide and H 
pixels high. An example of a gray-scale image is shown in Figure 3. Thus, an image 



Figure 3. A gray-scale image. 

is described by means of a. W x H matrix of nonnegative integer values /(x, y) that 
indicate the light intensity of the pixel with coordinates {x,y), as shown in Figure 4, 
where the origin (position (0,0)) of the matrix is at the left bottom corner. This 
makes the matrix correspond to the pixel coordinates in the image plane, although 
this may not be compatible with the known definition of matrices. The range of 
values assigned to the gray level of pixels in images usually depends on the number 
of bits to be used for representing the gray-scale values. Thus, the total number of 
levels in a gray-scale image is usually a power of 2. Most computer systems now 
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Figure 4. Matrix representation of image plane. 


support representation of eight bits for the gray-scale values. Thus, they range from 
0 to 2® — 1 (or 255). The value 0 is assigned for black, and 255 for white. For the 
6x6 image shown in Figure 5, the corresponding gray-scale pixel values are shown 
in the image matrix in Figure 6. 



Figure 5. A simple 6x6 gray-scale image. 
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Figure 6. Gray-level values of the 6x6 image. 
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For a 16 X 16 pixel window on an image of a polygonal object, as shown in 
Figure 7, the actual gray-level values of its pixels are shown in Figure 8. 



Figure 7. A 16 x 16 window on an image of a polygonal object. 


B. EDGE DETECTION PROBLEM 

The problem of fundamental importance in image analysis is the edge detection 
problem. Usually, edges in an image are boundaries of objects. Thus, edge detection 
is the simplest way to identify important contours and objects in the image. Edges 
are curves or lines in the image plane across which there is an abrupt change in 
brightness. By using the difference in gray level between image pixels, we can locate 
the boundaries of objects. By doing so, most of the relevant information about object 
shape, location, and distance from camera can be obtained. For instance, the image 
in Figure 5 has four edges. 

A gradient region is a small region in an image where light intensities are 
changing rapidly. It is defined as an intensity discontinuity in the image [Ref. 3]. 
The problem now is how to locate those regions in a given image, and then find 
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Figure 8. Gray-level values of the window pixels in the polygonal object image from 
Figure 7. 

their corresponding edges. In the following two sections, a solution to this problem 
is presented. 

C. GRADIENT DIRECTIONS AND GRADIENT RE¬ 
GIONS 

1. Gradient Computation 

A basic and common step in the process of detecting edges in a gray-scale 
image is to compute the gradient at each pixel. The partial derivatives and 
of the intensity function / of a pixel p with respect to x and y can be computed using 
a gradient operator. A gradient operator is represented by a pair of masks, and 
Aj,. Each mask is a square matrix of weights mapped onto a group of pixels around 
an origin (or center) pixel. The mask A^, will be used for computing the horizontal 
direction gradient, and Aj, for the vertical direction gradient. There are several edge 
operators that can be used in edge detection [Ref. 67]. Figure 9 show's three of 
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the well-known operators. In Figure 10, we show 3x3 pixel locations. The eight 
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Figure 9. Gradient operators. 


surrounding pixels about a center pixel at location p = (x, y) will be used to find the 
gradient information for pixel p. 


(a:-l,y + l) (x,y + l) (x + l,y + l) 
(x-l,y) {x,y) (a: + l,y) 

(x-l,y-l) (a;,y-l) (a: + l,y-l) 


Figure 10. Indices of 3 X 3 pixel set centered at pixel p = {x,y). 

The two gradient components gj;{p) = and gy{p) = can be computed 
using the Sobel operator by multiplying each of the weights corresponding to those 
pixels in S on each mask with its intensity value. 

9x{p) = -/(x-l,y-l)-2/(x-l,y)-/(x-l,y + l) 

+/(x + 1, y - 1) + 2/(x + 1, y) + /(x + l,y + 1), 

9y{p) = /(x-l,y + l)-|-2/(x,y + l) + /(x + l,y + l) 

-/(x - l,y - 1) - 2/(x,y - 1) - /(x -1- l,y - 1). 

Then the gradient magnitude g{p) is computed from these two orthogonal gradient 
components (Figure 11) as follows: 
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9{p) = \/{9x{p)f + {9y{p)f, 


when g{p) > 0, its gradient direction (j){p) is defined as 


<l){p) = atan2(^gy{p),g^{p)y 

The function atan2(gy{p),gi{py gives the direction in four quadrants [—7r,x]. It is 
more useful than the mathematical equation tan~^i^gy[p) jgx{p) ) because the latter 
has range —fjf ] defined when gx{p) = 0 [Ref. 68]. The gradient 

direction information of pixels is extremely valuable in the straight edge detection 
task, as shown later. 

A pixel p is said to be significant if its gradient magnitude g{p) is greater than 
some threshold value. The gradient image (the image with only pixels of significant 
gradient magnitudes) obtained from the original gray-scale image in Figure 3 is shown 
in Figure 12. 


2. Gradient Regions 

Let be the direction of a straight edge L in an image, and let p be a pixel in 
L (Figure 13). Then the gradient direction <f){p) must be approximately orthogonal 
to ij). The relation between the edge direction ^ and the gradient direction <^(p) is 
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Figure 12. A gradient image. 


defined by <j>{p) « ^ + -I or t/> — -I-Therefore, all pixels in one straight edge must have 
a similar gradient direction (j). We say “similar” because noise can affect the value of 
Through this observation, we can define a “gradient region”. A gradient region 
R in an image J is a set of pixels that satisfies the following conditions: 



Figure 13. Gradient and edge directions. 
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(1) Each pixel p \n R has a significant gradient magnitude ^(p). 

(2) Each pixel p m R has a similar gradient direction 

(3) All pixels in R are connected by eight-neighborhood (any two pixels in R must 


be adjacent within a 3 x 3 window and one of them is the center of this window). 


Figure 14 is a blowup of a part of Figure 12, including boundaries of the light 


switch beside the door lock. It shows four distinct gradient regions, and each pixel in 
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Figure 14. Four distinct gradient regions in the gradient image. 


a particular gradient region R is represented by a unique symbol. 
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D. FINDING EDGE FEATURES BY LEAST-SQUARES 
FITTING 

Assuming that we find an edge region, the next task is to find its linear features. 
The least-squares linear fitting algorithm [Ref. 69] is a very robust and helpful method 
for doing this. Although it was originally used to find line segments from sonar return 
data for mobile robot navigation, it can be used in image understanding as well. 
For a set R of pixels representing one gradient region, where R = {pi, ...,pn} and 
Pi = {xi,yi), we obtain the moments rrijk by 




t=i 


where 0 < j, /;: < 2 and j + k <2. Actually, moo = ^ 

The secondary moments Mij around the centroid C — {px, Py) where px = 

and Ptj — are 
' ^ moo 

n 

M20 = - pxf = m2o - 


m 


10 


t=\ 


~ ^^x){yi - = rail - 


moo 

miomoi 


i=\ 


moo 


^02 = - N? = "^02 - 


m 


01 


i=l 


moo 


A pixel p = (x, y) that satisfies 


X cos a + y sin a = r 


(ii.i) 


lies on a line L whose normal has an orientation a and whose distance from the origin 
is r (Figure 15). We represent this line L by (Q:,r). This representation has a striking 
advantage as opposed to the usual method of using a formula y — f{x), because 
the former method has no difficulty in expressing lines that are perpendicular to the 
X axis. The signed distance (or residual) S{ from a pixel p,- = {xi, yi) to the line 
L = (r, a) is 

6i = Xi cos a -t- yi sin a — r. 
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Figure 15. Set of pixels and their fitted line 


Therefore, the sum of the squares of all the residuals is 


S = ^cos a + yi sin a) — rj 


i=l 


Since the line which best fits the set of points is supposed to minimize 5, the optimum 


line (r, a) must satisfy 

dr da 

Thus, 


dr 


n 

= —2 ^ (^{xi cos a + yi sin a) — r^ 


i=l 

n 


= 2 I r 1 - cos a ^ - sin a ^ j/i I 

\ i=l z=:l t=l / 

= 2(r moo “ ^lo cos a — moi sin a) = 0, 


and 

mio moi . , . 

r =-cos a H-sm a = fix cos a + fiy sin ct, 

moo ^00 

where r may be negative. Substituting r in Eq. II. 1 by Eq. II.2, we obtain 


(II.2) 


as 

da 


2 - Hx) cosa + (yi - yy)sma'j (-(x, - fix) sina + (yi - fiy) cos a) 

i=l 
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Therefore, 


- ^i-yf - - ^^xf) sin a cos a 

Z=1 

n 

+ 2 ^(x, - fix){yi - /Xj)(cos^ a — sin^ a) 

i=l 

(A/o2 — ^ 2 o ) si'fi 2 q: T cos 2o; = 0. 


2q: — atan2(— M 02 — ^ 20 )- 

Note that, by Eq. II.3, the value of 2a can be in any quadrant, and so a G [—f’f]' 
Eqs. II.2 and II.3 are the solutions to the least-squares fitting problem. 



Figure 16. Two endpoints of a line segment L. 

Since the residual Si of a pixel pi = (x,-, yi) is given by 

Si = Xi cos a -|- yi sin a — r, 

the projection, p'^ of the point pi onto the fitted line is 

Pi = (x, — Si cos a, yi — Si sin a). (II-4) 

We will use pj and p^ as estimates of the endpoints of the line segment L obtained 
from the set of data points R (Figure 16) [Ref. 68]. 
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E. SUMMARY 

The principle of edge detection using gradient regions is described in this 
chapter. The principle is illustrated in Figure 17. The first image, the original one, 
has three areas with different intensities. There are three gradient regions separating 
them. The line segments corresponding to these gradient regions are then obtained 
using the least-squares fitting, as described in the previous section. This principle 
can be applied in several ways. In the next chapter we discuss the image-scanning 
method using this principle. 
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Figure 17. Principle of edge detection using gradient regions. 
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III. 


STRAIGHT-EDGE-FINDING METHOD 
BY SCANNING 

The principle of edge detection described in the previous chapter can be applied 
in different ways. In this chapter, one such method is presented. In this method, the 
whole image is scanned pixel-by-pixel to find those pixels that form gradient regions 
and hence the line segments that represent those gradient regions. Figure 18 shows the 
conceptual view of the operations performed to detect edges using this method. This 
method is different from that used in [Ref. 35] in the technique for forming gradient 
regions. The method includes a simpler approach and different data structures for 
that purpose, as we will describe. The process is divided into two levels: 

• Pixel-level operation: to decide whether each pixel is an “edge” pixel or not, 
and to decide in which region this pixel should be included. 

• Region-level operation: to find the line segment information for each gradient 
region. 

In this chapter, an overview of the method is presented, followed by a description of 
the algorithm and data structures. Finally, a sample of the experimental results is 
presented. 

A. IMAGE SCANNING 

The image is scanned starting from the pixel at (1,1), by excluding the pixels 
that exist at the image outer boundaries. For an image with size oiWxH pixels, 
the pixels on rows 0 and W — I are excluded, as well as those on columns 0 and 
H — I, because the Sobel edge operator we are using (see chapter II) requires eight 
surrounding pixels about the central pixel. The scanning is performed from left to 
right and from bottom to top, as shown in Figure 19. 
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Figure 18. Main operations for edge detection. 


B. CONNECTIVITY TEST 

During the scanning process, it is necessary to test whether the pixel has 
significant gradient magnitude g{p) or not. If so, its gradient direction (f>{p) will be 
computed. This pixel then becomes a candidate to be a part of some gradient region 
in the image. Finding this region for a specified pixel is the subject of this section. 
One characteristic of an edge region is the connectivity. The connectivity test of a pixel 
is intended to see if the pixel can be connected to some gradient region containing 
one of its neighbors. If the new scanned pixel p = (i, j), such that 1 < i < W — 2 
and 1 < j < H — I, then the connectivity test is done for the pixels at (^ — l,j), 
(i — 1,J — 1), {i,j — 1), and (i -f l,j — 1), which means that the test includes the 
pixel immediately to the left of the current pixel and the three nearest pixels in the 
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j/-axis 
of image 


next lower row. This situation is shown in Figure 20, where the current pixel at 
(z, j) is represented by a bullet and the pixels indicated by shaded squares are those 
considered in the connectivity test. However, there are some special cases that should 
be considered for the connectivity test. For pixels at locations with j = 1, the test 
will be done with only one pixel to the left. For pixels at locations with i = 1, the 
test will be done only with the two pixels (z, j — 1) and (z + 1, j — 1). For the pixels 
with (z* = VF — 2), such that 1 < j <// — !, the test will be done with only three 
pixels, (z - 1, j), (z - 1, j - 1), and {i,j - 1) (Figure 21). 

If one of the neighbors of the current pixel p belongs to some region and 
if the average gradient direction of R is close (l>{p)j then the pixel p should be added 
to that region i?, as shown in Figure 22. If it is found that p does not belong to 
an existing gradient region, a new region will be created and the new pixel will be 
considered the first one in the new region. This situation is shown in Figure 23. 

As we add a new pixel to an existing gradient region we maintain the 
region characteristics. Selecting one out of the neighboring pixels to join its region, if 
it belongs to any, is one of the most important tasks in this algorithm. The region of 
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Figure 19. Scanning direction. 
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Figure 20. General case of connectivity test. 



Figure 21. Special cases of connectivity test. 
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Figure 22. A pixel is included in the region as one of its neighbors if it satisfies the 
connectivity test. 
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Figure 23. Current pixel starts a new region if it cannot belong to a region as one of 
its neighbors. 

the chosen pixel should have the minimum difference in direction between its average 
gradient direction and that of the current pixel. 

C. DATA STRUCTURES 

We describe an image by a two-dimensional array of size W x H. This cor¬ 
responds to our usage of matrix representation of images, explained in the previous 
chapter. Each element represents the intensity value f{p) of pixel p = {x,y) with 
range of 0 to 255. The image pixel data structure is shown in Figure 24. 

Image Pixel 
• Intensity value 


Figure 24. An image pixel data structure. 


1. Row of Pixels 

Two one-dimensional arrays [0..W — 1] of the same structure are used for 
storing processed pixel information on the current row in the image (Current) and 
the previous row (Previous). Each element of the array is indexed by the r-coordinate 
of the pixel and contains the information shown in Figure 25. 
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Row Pixel 


• Gradient magnitude 

• Gradient direction 

• Pointer to gradient region 


Figure 25. Current or previous row pixel data structure. 


2. Gradient Regions 

Gradient regions are described as a list structure (Figure 26). Each element of 




Figure 26. List structure of gradient regions. 


the list corresponds to a gradient region and contains the moments and least-squares 
fitting parameters of the region and its first and last pixels. At the creation of a 
gradient region the parameters should be initialized, and they are updated when a 
new pixel is added to the region. From these parameters, the number of pixels, moo, 
is used to judge the strength of a gradient region. A region with a very small number 
of pixels is interpreted as some noise in the image. It is desirable to have strong 
regions to obtain major line segments corresponding to them. Another attribute is 
the average value (l)avg of the gradient directions of the pixels included in the region. 
This attribute is used in the comparison to verify the closeness between a pixel and 
a region. To obtain the line segment, the moments (mo],mio,^ii,^ 20 ,”^ 02 ) and 
centroid [nxiliy) are included in the structure. The first and last pixels {qi and 92 ) of 


32 
























the region should be known. They are used to compute the exact endpoints (ei and 
62 ) of the line segment. The region structure is represented by the parameters shown 
in Figure 27. 

Gradient Region 

• Num. of pixels f/WQQ) 

• Sum of x-coordinates ( w jq ) 

• Sum of y-coordinates ( ) 

• Sum of squares of x-coordinates ( m2Q ) 

• Sum of squares of y-coordinates ( /W02 ^ 

• Sum of products of x and y-coordinates ( ’^w) 

• x-value of centroid ( |X ^ ) 

• y-value of centroid ( p ^ ) 

• Starting pixel ( ) 

• End pixel ^^2 ) 

• Gradient directions average ( (p ) 

• Pointer to next region (Next) 

Figure 27. Gradient region data structure. 

3. Line Segment 

A detected line segment L is represented by the parameters shown in Figure 28. 
These include the two endpoints e\ and 62 , as well as the orientation a of the normal 
from the origin to L and the length r of the normal to L (See Chapter II). The 
number of pixels (moo) of its gradient region also is included. 
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Line Segment 

• Number of pixels (m qq ) 

• First endpoint { e^) 

• Second endpoint ( ^2 ^ 

• Length of its normal ( r ) 

• Orientation of the normal ( a) 


Figure 28. Line segment data structure. 

D. ALGORITHM 

In this section, we describe the pseudo-code of the major algorithms of the 
method. Specifically, the main algorithm, the method to compute the gradient in¬ 
formation at each pixel, the connectivity test, the least-squares fitting, and com¬ 
putation of the endpoints of the detected line segments will be discussed. The 
main algorithm is shown in Figure 29. The scanning is performed pixel-by-pixel 
(Line 1). The gradient magnitude g at pixel p is computed using the function 
ComputeGradientMagnitude(p, X) in Line 2. If the computed value is larger than 
some threshold value Gt (Line 3), then p is considered significant, and hence the 
gradient direction <f> will be computed in Line 4. Then the region R to which p will 
be included is returned as a result of performing the function ConnectivityTest(p, 
Current, Previous), in Line 5. In this function, we examine the neighboring pixels to 
p in the current and previous rows (as explained in Section B). One of two actions 
is performed based on the connectivity test. If a new region is created, it should be 
initialized with p as the first pixel (Lines 6-7). If p can be included in an already open 
region, we update the least-squares fitting parameters of the region and set the last 
pixel in R to be p. This is done through the function UpdateRegion(i?,p) in Line 
9. After the scanning process is completed, the next task is to scan through the list 
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DetectEdgesByScanning(J) 

begin 

I . for each image pixel p G X do 

2- 9{p) — ComputeGradientMagnitude(p, J, 

3. if g{p) > Gt then : 

4. <?i(p) = atan2(pj,,pj;) 

5. ConnectivityTest(p,Current,Previous,/?) 

6 . if NewRegion(/?) 

7. InitializeRegion(/?, p) 

8 . else 

9. UpdateRegion(/?, p) 

10. for each edge region R 

II . if {moo{R)>no) 

12. LeastSquaresFitting(/?, a, r) 

13. ComputeEndpoints( 9 i(i?), q 2 {R),C(, r, ej, 62 ) 

14. return {moo(/?),o;,r, 61 , 62 ) 

15. else 

16. return nil 
end 


Figure 29. The main algorithm for edge detection by scanning. 

of regions (Line 10). For any region that has a sufficient number of pixels (Line 11), 
we compute the segment parameters by the least-squares linear fitting, and the exact 
two endpoints (Lines 12-13) using the values of the first pixel in R, which we denote 
q\{R) and last pixel q 2 {R). 

1. Computing Gradient Magnitude 

The function ComputeGradientMagnitude is used to apply the Sobel op¬ 
erator at pixel p = {x^y) for computing its gradient magnitude. The Sobel edge 
detector provides good performance and is relatively insensitive to noise. It provides 
a balance between the computation time and the accuracy of edge direction [Ref. 5]. 
The algorithm is presented in Figure 30. We use the two dimensional array /[a:][j/] to 
represent the intensity values /(p) for p = {x,y). 
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ComputeGradientMagnitude(p, J, Qx, Qy) 
begin 

1. Qx = -I[x - l][j/ - 1] - 2 X /[a: - Ijfy] - I[x - l][j/ + 1] 

-\-I[x + l][t/ - 1] + 2 X /[x + l][j/] + /[x + l][j/ + 1] 

2. Qy = -I[x - 1][2/ - 1] - 2 X I[x][y - 1] - I[x + Ijfy - 1] 

+/[x - l][j/ + 1] + 2 X I[x][y + 1] + I[x + !][?/+ 1] 

3. return ^gl + gl 
end 


Figure 30. Computing gradient magnitude with Sobel operator. 

2. Connectivity Test Function 

The general algorithm of connectivity test is presented in Figure 31. In lines 
1-8, the regions Ri of the neighboring pixels are determined and the differences in 
gradient directions d, between <f>{p) and the average gradient direction of each of Ri 
are computed. We take a region with the minimum difference d,-, if this value of d,- is 
less than some angle threshold or start a new region (Lines 9-14). Finally, the region 
R to which the pixel p belongs is determined and returned (Line 15). 

3. Least-Squares Fitting 

The least-squares fitting is a robust method to compute the parameters of 
a line segment that best fits a set of points in a region R. The parameters of 
the region are updated when a new pixel p = (x,j/) is added to R. The function 
UpdateRegion(i?,p) performs this task. Now the values of r and a for the normal 
to the required line segment can be obtained. They are computed by the function 
LeastSquaresFitting, which is presented in Figure 32. Notice that the moments 
^00) ^205 a^nd mo 2 are among the parameters of region R. For simplicity, 

we use them without referring to R in the algorithm. 
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ConnectivityTest (p,Current,Previous,R) 
begin 

1. Ri = Region(Current[ 2 : — 1]) 

2. di = Normalize(|(?i>(p) — 4>avg{Ri)\) 

3. R 2 = Region (Previous [a: — 1]) 

4. c ?2 = Normalize(|<;/>(p) - <f>avg{R 2 )\) 

5. R 3 = Region(Previous[a:]) 

6. dz = Normalize(|^(p) — <?^at)s(R 3 )|) 

7. R 4 = Region(Previous[a: + 1]) 

8. d 4 = Normalize(|^(p) — <i)avg{RA)\) 

9 . (<i, i) = MinValueAndIndex(c?i, (^2) <^35 <^4) 

10. if d > d ; no closeness in gradient directions 

11. R = CreateRegion() 

12. NewRegion(R) = TRUE 

13. else 

14. R = Ri 

15. return R 
end 


Figure 31. General connectivity test algorithm. 


LeastSquaresFitting(/2, a, r) 


begin 


1 . 

2 . 

3. 

4. 


M20 = m 2 o - 

Mn = mil - 


Mo2 — mo2 — 


moo 


moo 


moo 

2 


a = 0.5 X atan2(—2Mii, 


Mo 2 — M20) 


5. r = Hx 'X cos(a:) fiy x sin(Q;) 

6. return (o, r) 


end 


Figure 32. Least-squares fitting of a region. 
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ComputeEndPoints(5i, 92, ct, r, ei, 62) 
begin 

1. 6\ = Xgi X cos(q) + ygi X sin(a) — r 

2. S 2 = Xg 2 X cos(q;) + yg 2 X sm(a) — r 

3. aji = Xgi — Si X co5(q;) 

4. yi = 2/,i - X 5 w(q:) 

5. X 2 = Xg2 — ^2 X cos(q;) 

6. ?/2 = J/92 -S2X sin(a) 

7 . return (ei = (xi, 7 /i),e 2 = (x2,y2)) 
end 


Figure 33. Endpoints computation algorithm. 

4. Computing Endpoints of Segments 

After computing the least-squares parameters (r, a), we compute the two end¬ 
points Cl = (xi,yi) and 62 = (x 2 , 2 / 2)5 using the first pixel qi = (x,i,y,i) and last 
pixel q 2 = ixg 2 ,yg 2 )- This is described in Figure 33. 

E. EXPERIMENTAL RESULTS 

The algorithm was tested on actual images taken by a CCD camera. The input 
images shown in Figures 34 and 36 are used to produce the line segments shown in 
Figures 35 and 37, respectively, using the edge detection algorithm described. 
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Figure 34. Input image of a printer. 



Figure 35. Line segments extracted from a printer image. 
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Figure 36. Input image of a hallway portion. 



Figure 37. Line segments detected form the hallway portion image. 
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IV. 


DIRECTION-CONTROLLED EDGE 
TRACKING METHOD 

A. PRINCIPLE 

In this chapter we present a new method of detecting an edge, called “Direction- 
Controlled Edge Tracking” [Ref. 70]. Given a significant pixel po in an image, the 
edge tracking task is to find a sequence Q of pixels on the edge containing po- This 
task is performed in two opposite directions and i/>_) that are approximately 
orthogonal to the gradient direction <f>{po), as shown in Figure 38. To illustrate the 



Figure 38. Edge tracking in two opposite directions. 

idea of the algorithm. Figure 39 shows the sequence of pixels obtained by tracking 
the slanted edge in the lower right portion of the image shown in Figure 36 in the 
previous chapter. The best-fit line segment is also shown. The best-fit line L to Q is 
computed through the least-squares fitting method to obtain (i) the normal direction 
a, and (ii) the distance r from the origin to the line L. However, finding the equation 
of this line is not enough for applications in image understanding which require com¬ 
puting the endpoints of the line segments. While tracking a pixel sequence Q, the two 
end pixels of this sequence, qi and q 2 , are logged. Using these two pixels as the best 
estimate of the edge endpoints is not recommended, because noise may affect their 
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Figure 39. Edge pixel sequence with its fitted line segment. 


positions. Instead, and q 2 are used to compute the two projected endpoints e-i and 
62 as explained later. We call a straight line L corresponding to a pixel sequence Q 
major when the number of pixels n contained in Q is more than a threshold value no, 
say 50. 

1. Edge Tracking Algorithm 

Given an image / and an initial pixel po, the algorithm returns a line segment 
L corresponding to an edge on the image (Figure 40). The overall algorithm for edge 
tracking and segment detection is described in Figure 41. The function “TrackPixels” 
in lines 2 and 4 is further elaborated in Figure 43. 
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Figure 40. Line segment detection by tracking. 


B. TRACKING CONTROL BY EDGE DIRECTION 

The edge tracking task is performed as a sequence of SelectNextPixel op¬ 
erations which select a neighboring pixel p 2 from the current pixel pi. In each Se¬ 
lectNextPixel operation, selecting the neighboring pixel with the maximum gradient 
magnitude is apparently a reasonable strategy, since this criterion minimizes a pos¬ 
sible noise effect. However, if an edge is tracked under this criterion only, a curved 
line or a sequence of connected line segments might be tracked. Such a result does 


ComputeSegment(po, T) 

begin 

I- Q = {po} 

2. Q = TrackPixels(po, Q, T, -1) 

3. qi = LastPixel((5) 

4. Q = TrackPixels(po, Q, T, —|) 

5. q 2 = LastPixel((5) 

6 . LeastSquaresFitting((5, or, r,) 

7. ComputeEndPoints(^i, q 2 , a, r, ei, 62 ) 

8 . if(|g|>no) 

9 . return (a, r, 61,62) 

10 . else 

11. return nil 
end 


Figure 41. Edge tracking and segment detection algorithm. 
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not meet the requirement of detecting straight edges separately. To avoid this uncon¬ 
trolled tracking, the direction V’ of the segment being tracked is computed using the 
partial Q obtained so far. This direction 0 is computed at each step and is used to 
control the SelectNextPixel process in the following way: (i) defines three neighbor 
pixels from which the next pixel will be selected, and (ii) a selected pixel is tested for 
the consistency between its gradient direction and i/’- 

1. Tracking Direction Evaluation 

To obtain robust orientation information V’ for Q, the least-squares fitting al¬ 
gorithm is applied to Q. By doing this, we obtain the normal direction a of the best 
fitted line for it (assuming |(5| > 2). Since the direction a is computed by the posi¬ 
tional information of all the pixels collected so far, this value is expected to contain 
extremely fine directional information. Notice that the range of a is [—f,f]. We 
observed in Figure 38 that all pixels in a given straight edge must have a similar gra¬ 
dient direction (/>, and this value should be close to a or tt — a. From this observation. 


y 



Figure 42. Finding tracking direction. 

we assume that the direction 'i/> of an expected edge is perpendicular to a. Therefore, 
we compute and V’- as 


V’+ = + f. 
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V>- 



This relation is shown in Figure 42. Obviously, ' 0 + and 0 _ are mutually apart by 
TT. The edge is first tracked in the direction of 0+ in the range of [ 0 , 7 r], and next it 
is tracked in the opposite direction 0 _ in the range of [—tt, 0 ] to complete the edge. 
However, in the initial state where Q = {po}, it is impossible to find a by least-squares 
fitting. In this case, the gradient direction 0(po) is used to evaluate 0±. Since the 
range of 0(po) is [—tt, tt], we need to normalize it into the interval of »f] to keep 
consistency in the evaluation of 0 ±. 


4 ’± = V(«i(po))±^. 

The normalization function M{')) for an angle 7 is defined as follows: 

✓ 

Ar(7-7r), if7>f 

-^( 7 ) = + tt), if 7 < -| 

^7, if - f < 7 < f 

For instance, = f. 


2. One-Way Pixel Sequence Tracking Algorithm 

Figure 43 shows the algorithm of tracking a pixel sequence Q starting with a 
pixel po in one way. As mentioned earlier, this procedure is performed twice, once 
in the direction 0 + (with /9 = ^) and next in the direction 0 _ (with ^ = —f), as 
shown in the main edge tracking algorithm (Figure 41). The moment calculation for 
the least-squares fitting task is included in lines 4 and 12. 


C. SELECTING NEXT PIXEL 

1. Quantization of Edge Direction 

As outlined in the previous subsection, the edge direction 0 (0+ or 0 _) is 
utilized in two ways: 

1 . Finding which specific pixels among eight neighbors should be examined for 
selecting a next pixel, and 
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TrackPixeIs(po, Q-,^-,1^) 
begin 

1. if 101 = 1 

2. a = M(4>(po)) 

3 . else 

4. LeastSquaresFitting((5, Of, r) 

5. Pi = po 

6 . doforever; 

7 . ij: = a + ^ 

8. p 2 = SelectNextPixel(pi, 

9 . if (p2 = nil) 

10 exit 

11 . Q = QV{p2} 

12. LeastSquaresFitting((5, ct, r) 

13 . Pi =P2 

14. return Q 
end 


Figure 43. Tracking one side of the pixel sequence. 


2. Testing whether a pixel among these neighbors has a gradient direction con¬ 
sistent with xjj. 


First, the domain [—tt, tt] of ^ is divided into eight sub-domains 


Vi = 


2z — 1 2t -|- 1 

TT, -X 


8 ’ 8 

For instance, V’ = f belongs to T>i = ’ 


8^’ 8^ 


for z = 0, • • •, 7. 


For each sub-domain X>,-, we assign 


a set Si of three neighboring pixels, as shown in Figure 44. For example, if the 
current pixel is pi = (x,y), and if V’ € T>i, the next pixel must be selected from 
Si = {(x,z/ -f 1), (x -b l,p -b l),(x -b 1,2/)}, as shown at the top right portion in 
Figure 44. The pixel (x -b 1,2/ + 1) is precisely located in the direction ^ from the 
pixel pi. The set <Si includes two more neighboring pixels of (x -b 1,2/ + !)• 


2. Next Pixel Selection 

Given the set S,, the selection of a next pixel p 2 is performed as follows: 
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Figure 44. Next pixel set for each range T>i. 


• We choose the most significant pixel p 2 in <5,-, and 

• Test whether its gradient direction 4>{p2) and the tracking direction ^ are 
consistent^ i.e., ^ 6 (^ 2 ) is approximately normal to V’- 


Definition: Two directions (f> and are considered to be consistent if 



for a small angle threshold e. 

If the most significant pixel passes this test, it is defined to be the next pixel p 2 - 
If this consistency condition is not satisfied for the most significant pixel, then we test 
whether there is another significant pixel in Si. If so, the consistency test is executed 
again for this pixel. If this is consistent, this becomes the next pixel p 2 . Otherwise, 
the test is executed for the last (third) pixel in In other words, the selection and 
consistency test for the remaining pixels in Si are repeated. Figure 46 shows two 
cases: (a) two directions xp and 4> are consistent, or (b) they are inconsistent. 

Suppose the next pixel p 2 is found. In this case, this new pixel will be added 
to the pixel sequence Q and is defined as the current pixel p\. From that pixel, the 
SelectNextPixel operation will be repeated again. If there is no pixel that satisfies 
both conditions, the result nil is returned when one side of tracking task ends in this 


47 


































































SelectNextPixel(p], 

begin 

1. 5 = Neighbors(pi, 0) 

2. while <5^0 

3. p 2 — LargestGradientPixel(5, X) 

4. if Consistent(V’, <^(p 2 )) 

5. return p 2 

6. <S = 5-{p2} 

7. return nil 
end 


Figure 45. Finding next pixel. 



(a) Two consistent directions (b) Inconsistent directions 

Figure 46. Consistency test. 


specific direction (0+ or 0-)- The description of the SelectNextPixel function is given 
in Figure 45. 


D. COMPUTING SEGMENT PARAMETERS 

The least-squares fitting method provides the representation (r, a) of the line 
segment that corresponds to the detected edge. However, some other information 
may be needed for further processing such as moo, the number of pixels tracked for 
the edge. For many applications including image understanding, an estimate of the 
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two endpoints of the line representing Q is also essential. For that purpose, we keep 
a record of the first and last pixels ( 91 , 92 ) = (Pi^Pn) = J/gi), (a:g 2 ) J/g 2 )) in the 

sequence Q. We use their projection points ( 61 , 62 ) on the line [Ref. 69]. They are 
computed as 


cat = {xgk — h cos a, ygk — Sk sin a), for A; = 1,2, 


where 


= Xgk COS a + Hgk sin a — r, for A: = 1,2. 
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V. A GLOBAL ALGORITHM FOR EDGE 
DETECTION USING RANDOM HITTING 


To detect all major edges in an image, we want to use the edge tracking 
algorithm described in the previous chapter. To obtain initial pixels to start edge 
tracking and to minimize processing of insignificant pixels, we introduce the use of a 
random-hitting method. 

The global algorithm for straight-edge detection is designed as shown in Fig¬ 
ure 47. A random position is computed for an initial pixel po (Line 3). If it is 
significant and is not close to any of the previously detected line segments (Line 4), 
the edge tracking and segment detection algorithm (Figure 41) is executed starting 
from Po. If the new line segment L is a major one, it is added to the set C of detected 
major line segments. This process is repeated until K line segments are obtained 
(Line 2). 


DetectEdges(Z,/\ ,£) 

Input : Image: X , estimated number of lines : K 
Output : Set of line segments: L 
begin 
1. £ = 0 

2. while \C\ < K do 

3. Po = RandomPosition() 

4. if ^ Significant(po) A NotClose(po, £) ) 

5. L= ComputeSegment(po,2’) 

6. if Major(L) 

7. C UL 
end 


Figure 47. Global algorithm for fast straight edge detection. 
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A. RANDOM HITTING 

1. Concept of Random Hitting 

Although the most straightforward approach for detecting all edges in an im¬ 
age is to examine all pixels, most of the pixels in a normal image are insignificant. 
Therefore, this method is extremely inefficient. To minimize the number of exami¬ 
nations of insignificant pixels, we compute (hit) a pixel position po randomly. If po 
is significant, the edge tracking algorithm, explained in the previous chapter, is exe¬ 
cuted. We adopt the linear congruential pseudo-random integer generating method 
[Ref. 71]. A pseudo-random number sequence is defined as follows: 

Rn+i = (aRn + c) mod M, n > 0, (V.l) 

where M is the modulus (M > 0), a the multiplier (0 < a < M), c the increment 
(0 < c < M), and Rq the starting value (0 < Ro < Af). 

A desirable property for a pseudo-random number generator is that no integer 
will be generated twice before the sequence of the pseudo-random numbers includes 
all non-negative integers less than M. The following theorem in [Ref. 71] describes a 
necessary and suflficient condition to the parameters a and c to satisfy this property. 

Theorem V.l The linear congruential sequence (V.l) defined by M, a, c, and Rq 
has a period length M if and only if 

i) c is relatively prime to M; 

ii) 6 = a — 1 is a multiple of p, for every prime p dividing M; 

Hi) b is a multiple of 4, if M is a multiple of 4- 

Recall that two integers a, b are said to be relatively prime if their greatest common 
divisor is 1. 

From now on, we let a=l and let c be a number relatively prime to M to 
achieve the maximum period length of M. Furthermore, we apply this pseudo-random 
number method to compute a sequence of pixel positions for an image oiW x H = M 
pixels. Each pseudo-random number R = Ri is then converted into a pixel position 
p = {x,y) by the transformation 

p = (a:, y) = [R mod W, R~ W), 
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where W is the width, in pixels, of an image. Then obviously a distinct pair of random 
numbers Ri and Rj are transformed into distinct pixel positions pi and pj, and vice 
versa. 

Because it is decided that a = 1, the next and last decision is on the selection 
of c. To further simplify the choice of c, we propose to choose two numbers j and k, 
with {0 < j < H) and (0 < ^ < W), where k is relatively prime to W. Then the 
value of c is given by 

c = j X W k. 

We applied this random hitting method to our edge-detection experiments. In our 
camera system, 

M = W X // = 646 X 486 = (2 X 17 X 19) X (2 X 3®) = 2^ x 3® x 17 x 19 = 313,956. 

Specifically, we chose j = 238 and k = 181, then 

c = j X W + = 238 X 646 + 181 = 182,999. 

To illustrate the concept, in Figure 48 the first ten pixels are shown with the order 
in which they are hit using this pseudo-random number scheme. The distribution of 
3000 pixels is shown in Figure 49. 

Another desirable property for this pseudo-random hitting scheme is that, if 
we divide an image into unit areas with the same size, the probability of being hit is 
approximately equal for each of them at any time. We found that the previous choice 
of c looks as if it satisfies this property of uniform distribution. This pseudo-random 
number scheme brought the experimental results presented in Section C. 

2. Edge Hitting Probability 

We discuss a theoretical analysis for soundness of adopting random hitting to 
perform edge detection. We consider an image X with a total of M pixels. How many 
hits are needed to detect a specific small region containing m pixels in X (Figure 50)? 
More precisely, what is the probability for the region to be detected after n hits? 
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Figure 48. Distribution of first 10 hits using pseudo-random number generator with 
c = 283 X 646-h 181 = 182,999. 


First we compute the function /(n), the probability for the region not being 
detected in n random hits. This “missing probability function” f{n) is computed as: 


M 


MJ 


Let Z = ^. Then 

m 




This is obviously a function of n. Specifically, let n — Z. Then 

Z 




It is well-known that 

Urn m = i, 

/i—^oo e 

where e is the basis of natural logarithm. Since m is a positive integer, the value 
of Z is always finite and f{Z) is never precisely equal to K However, for a realistic 
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Figure 49. Distribution of 3000 pixels using pseudo-random number generation with 
c = 283 X 646 + 181 = 182,999. 

value oi Z — f{Z) is very close to For instance, if Z= 100, the relative error 
between /(100) and ^ is approximately 0.5 % and the one between /(lOOO) and ^ is 
approximately 0.05 %. Thus, we can use ^ as a good approximation for f{Z) for a 
large value of Z. 

Now we explain how this analysis is related to the random hitting algorithm. 
Suppose we want to find an edge with a minimum length of 50 pixels . A region 
containing such an edge includes at least m=100 pixels. Assuming an image size 
of M=300,000 pixels and Z = -^=3000 times, then the missing probability function 
/(3000) ^ = 0.36788, and hence the detection probability is approximately 0.63212. 

Furthermore, if n = 2Z = 6000, then 



That increases the detection probability to 0.865. Even with ZZ = 9000 which is only 
3% of the total number of pixels, the detection probability is about 95%. Table I 
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Image 



Figure 50. An Image with total of M pixels having an m-pixels region, 
shows the approximation of detection probabilities for n = Z, 2Z, 3Z, 4Z, and 5Z. 


n 

missing probability 

detection probability 

z 

0.367 

0.633 

2Z 

0.135 

0.865 

3Z 

0.049 

0.951 

4Z 

0.018 

0.982 

5Z 

0.007 

0.993 


Table I. Approximation of detection probabilities for different n hits. 


With this analysis, for k regions in the image, with mi, m 2 , • • •, rrik pixels 
each, (Figure 51), let 


m = min m,-. 
i 


Then, using only 3 x ^ hits, almost all the segments will be detected. In the examples 
of results in this chapter, Section C, we tried to detect most of the major segments 
with at least 50 pixels length (m = 100) from a test image of size 313,956. After only 
2960 = 0,94 Z hits, 20 of an expected 24 of those major edges were detected. 
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Image 



Figure 51. An Image with total of M pixels having different regions. 

B. CLOSENESS TEST 

In this global edge detection algorithm, a randomly generated pixel po becomes 
a “seed” for a new segment if po is significant. However, when some line segments have 
already been detected, the newly generated pixel po may lie on one of these segments. 
We should avoid blindly tracking the same segment again starting from this pixel po- 
Therefore, each randomly generated pixel po must be examined to determine whether 
it is close to any of the previously detected segments. For this test, the distance 
d{po,L) between a given point po and a given line segment should be computed. A 
segment is represented as 

L = eT^= (xi,j/i) (x2,t/2), 

where ei and 62 are its endpoints with ei ^ 62 - Given a segment L, the plane is 
divided into three regions (Figure 52); Fi, V 2 , and Vq. Vi is the set of points p where 
Cl is the closest point on L from p. Likewise, V 2 is the set of points p such that 62 
is the closest point on L from p. Vq is the set of points p such that the closest point 
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Figure 52. Finding distance from point to line. 


on L from p is neither e\ nor 62 . Given p, if we know the region K containing p, the 
distance <i(p, L) is easily obtained. The 2D transformation group theory [Ref. 72] is 
applied to solve this problem of determining Vi. 

With the segment L, a local coordinate system is defined as follows. One of 
the two endpoints, say ei, is considered as origin Ol (Figure 53). Its local W-axis 
[xi) is aligned to the segment L, and the orientation 6 of xi is 

9 — atan 2 (i /2 ~ y\i ^2 ~ 2 : 1 ). 

Thus, the T-coordinate system is represented as qi = {x\,y\.,9)^ in the global image 
coordinate system. The pixel position p = (x,p) in the global coordinate system is 
converted into the L-coordinates (x*,t/*) as follows [Ref. 72]: 

X* = (x — xi) cos ^ + (p — Pi) sin 0 (V.2) 

p’ = -(x — xi) sin^ + (p - pi) cos (V.3) 

The local x-coordinate of the other endpoint 62 is, using Equation (V. 2 ), 

X 2 * = (X 2 - Xi)cos0 + (p 2 - Pi) sin 0. (V.4) 
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Figure 53. Pixel position in the L-coordinates 


By transforming (x, y) in the image coordinates into (x*,y*) in the L-coordinate 
system, the distance d(p, L) can be easily computed as 


d{p, L) = < 


d(p, ei) 

if x’ < 0 

in 

if 0 < X* < X2* 

d(p, 62) 

if X* > X2*. 


NotClose(p, C) 
begin 

1. x = r 

2 . while ^ 0 do 

3. L — LastSegment(A4) 

4. if d(p, L)<6 

5 . return (false) 

6 . M = M-{L} 

7. return (true) 
end 


Figure 54. Algorithm for closeness test. 

A threshold value 8 is assumed for the closeness test. If d{jpo^ L) < 8 for some 
previously detected line segment L, we consider po belongs to this segment L and 
the edge tracking is suppressed even though po is significant. If d{po, L) > 8 for all 
previously detected L, po is considered a new seed and a new edge tracking process 
is executed. The closeness test algorithm is shown in Figure 54. 
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C. RESULTS 


The new method for edge detection using direction-controlled edge tracking 
and random hitting, described earlier with its supporting functions, was implemented 
on a Silicon Graphics (TM) workstation and on the autonomous mobile robot system 
Yamabico-11 at the Naval Postgraduate School. This section presents some of the 
experimental results. First, start with an image of a square object as shown in 
Figure 55. The processed (examined) pixels, including the 608 randomly hit pixels, 
are shown in Figure 56. Figure 57 shows that all four edges of the square are detected 
by these 608 hits. This number is less than the total number of pixels in a single row 
of this image (646 pixels). 



Figure 55. A square object test image. 


For a sample indoor gray-scale image shown in Figure 36, we present the results 
of applying the algorithm. Figures 58, 59, 60, and 61 show the first 5, 10, 15, and 20 
segments detected by this algorithm. In these experiments, line segments having less 
than 50 pixels were discarded. The total numbers of random hits needed to obtain 
these segments are 77, 345, 985, and 2,960, respectively. Notice that these numbers 
are only tiny fractions of the total image size (313,956 pixels). Thus, by a relatively 
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Figure 56. Processed pixels before detecting all four line segments. 

small number of random hits, most of the major edges can be detected, as expected 
by the theoretical analysis. 

Table II gives more detailed numerical data of the twenty segments detected. 



Figure 57. All four segments are detected after 608 hits. 
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Figure 59. Ten line segments detected by 345 hits. 


Each row contains the data related to the zth line segment detected. For each i, Hi 
denotes the cumulative number of random hits done until we detect that line segment, 
n, the number of pixels tracked in the segment, eu and 62 , its two endpoints, £i the 


Figure 58. Five line segments detected by 77 hits. 
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Figure 61. Twenty line segments detected by 2960 hits. 


Euclidian distance between the two endpoints, a,- the orientation of the normal to 
the segment in degrees, and Ni the cumulative number of pixels examined in the 
whole process of detecting the first i line segments which also includes the number 


Figure 60. Fifteen line segments detected by 985 hits. 
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of examined pixels in segments which are not major. It was possible to detect five 


i 

Hi 

Tli 

A- 

^li 

^2i 

CVi 

Ni 

1 

9 

340 

339.014 

(517.308 , 138.994) 

(518.762 , 478.005) 

-0.246 

1020 

2 

36 

309 

307.996 

(236.693 , 444.002) 

(235.119 , 136.010) 

-0.293 

1953 

3 

45 

113 

112.001 

(573.715 , 312.000) 

(573.666 , 424.001) 

0.025 

2382 

4 

49 

141 

140.000 

( 5.899 , 142.000) 

( 6.002 , 2.000) 

0.042 

2820 

5 

77 

309 

308.001 

(551.092 , 409.009) 

(549.639 , 101.011) 

-0.270 

3951 

6 

98 

311 

310.000 

(476.939 , 444.999) 

(477.042 , 134.999) 

0.019 

4905 

7 

158 

316 

314.993 

(350.801 , 130.001) 

(352.952 , 444.987) 

-0.391 

5865 

8 

203 

203 

202.114 

( 33.941 , 440.269) 

(235.986 , 445.542) 

-88.505 

6543 

9 

249 

302 

301.015 

( 29.333 , 139.997) 

( 31.992 , 441.000) 

-0.506 

7527 

10 

345 

426 

424.004 

(605.151 , 59.999) 

(608.068 , 483.993) 

-0.394 

8820 

11 

503 

61 

60.008 

(136.598 , 389.994) 

(137.435 , 329.992) 

0.799 

9162 

12 

517 

98 

97.021 

(183.013 , 456.357) 

( 86.013 , 454.378) 

-88.832 

9456 

13 

600 

97 

96.047 

(245.001 , 149.018) 

(340.997 , 145.910) 

88.145 

9840 

14 

973 

131 

130.004 

(580.366 , 443.011) 

(581.390 , 313.011) 

0.451 

10611 

15 

985 

59 

58.026 

(169.100 , 333.057) 

(166.084 , 391.004) 

2.979 

10788 

16 

1028 

78 

76.987 

(198.791 , 395.978) 

(200.183 , 319.003) 

1.036 

11022 

17 

1744 

76 

67.140 

(549.703 , 104.654) 

(600.681 , 60.961) 

49.400 

11961 

18 

2622 

75 

74.076 

(598.522 , 409.938) 

(600.329 , 483.992) 

-1.398 

12825 

19 

2752 

79 

78.000 

(228.174 , 318.000) 

(228.180 , 396.000) 

-0.004 

13062 

20 

2960 

125 

124.008 

(351.995 , 443.870) 

(476.001 , 443.330) 

89.750 

13716 


Table II. Data of line segments obtained by the tracking algorithm. 


line segments by examining 3951 {=Ns) pixels, which is only 1.25% of the total pixels 
(313,956). Likewise, to detect 10, 15, and 20 line segments, 8820, 10788, and 13716 
pixels had to be processed, which are equal to only 2.8%, 3.43%, and 4.36% of the 
total pixels, respectively. 

The graph of Hi as a function of i is shown in Figure 62. The time to detect 
the {i + 1)®* line segment is generally larger than the time for the line, because the 
probability of hitting “undetected” segments becomes lower. 

Using several indoor scenes, we made experiments to find the distribution of 
the lengths of the edges detected from typical indoor images. We limit the maximum 
number of hits to only 3000 for these images. The following table shows the results 
for five test images taken in different positions in the hallway and in the lab. Each 
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Number of Segments i 

Figure 62. Number of random hits Hi to obtain first i segments. 

entry in the table shows the total number of obtained segments, and the number of 
lines in each of the three ranges of the lengths £ (in pixels) of these segments for each 
image. The results show that the majority of the line segments in the images have 


Image 

Num. of Segments 

75.0 <£< 100.0 

100.0 < £ < 300.0 

£ > 300.0 

1 

21 

2 

17 

2 

2 

21 

6 

12 

3 

3 

23 

6 

15 

2 

4 

10 

3 

6 

1 

5 

26 

6 

17 

3 


Table III. Number of line segments from several images. 


lengths in the range of 100-300 pixels, within which most of the landmarks in the 
indoor images fall. 

We also made experiments of detecting only vertical edges with a maximum of 
3000 hits, since vertical edges are essential in some of the image understanding tasks 
in robotics. The results obtained from three images sequentially taken in the hallway 
are shown in Table IV. Each entry in the table contains the number of vertical line 
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segments detected and how many hits {H < 3000) were actually needed for each 
image. The segments obtained are shown in Figures 63, 64, and 65. 


Image 

Num. of V. Lines 

Num. of Hits 

1 

13 

2810 

2 

12 

2857 

3 

15 

2553 


Table IV. Number of vertical line segments from a sequence of three images. 



Figure 63. Thirteen vertical edges detected by 2810 hits. 


Some line segments are detected due to reflections on the floor. These lines 
can be interpreted if we have a world model, the camera pose is known, and a proper 
pattern matching is performed. 

D. SUMMARY 

In this chapter, we have presented a global algorithm for a new efficient edge 
detection method that avoids exhaustive image scanning. We use a pseudo-random 
number scheme to find a starting pixel from which edge tracking for a linear sequence 
of pixels is performed. The directional information of the line segment is used to 
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Figure 64. Twelve vertical edges detected by 2857 hits. 

control the edge tracking. A robust least-squares fitting method is used to obtain the 
geometric features for the line segment that best fits the pixel sequence, including its 
endpoints. During this new edge detection method, redundant tracking of an edge 


Figure 65. Fifteen vertical edges detected by 2553 hits. 


67 




















is avoided by the closeness test. In real experiments, we have been able to detect 
most of the major edges in a test image of size 313,956 with a very small number of 
pixels processed (4.36% of total pixels). In other experiments on some indoor images, 
we detected more than twenty major edges, in the range of 75 pixels to more than 
400 pixel lengths, by less than 3000 hits. The algorithm also has been tested to 
detect vertical edges only. Most of the vertical edges in indoor images were detected 
by a total number of hits which is less than 1% of the total number of pixels in 
each image. The results obtained from these experiments show that this algorithm is 
efficient and useful for numerous image understanding applications and autonomous 
robot navigation. 
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VI. 


WORLD MODEL 


The geometrical model of the robot’s environment is essential for the inter¬ 
pretation of detected edges which can be used in the self-localization task. Also, the 
model is important for detecting obstacles, motion planning, and many other tasks 
for the robot. 

A. BACKGROUND 

Our efforts toward developing image understanding algorithms are aimed to 
be used by the autonomous mobile robots for model-based visual navigation. For the 
autonomous mobile robot Yamabico-11 at the Naval Postgraduate School, the current 
operating environment is part of the second floor in the Spanagel Hall building. In 
this chapter, we consider the problem of modeling that environment. Some of the 
capabilities needed for the model of the world are based on previous work done on 
the fifth floor of the same building (Ref. 36], as well as some techniques of computer 
graphics for perspective projection [Ref. 73]. By examining the old model, we noticed 
that some simplifications are needed for real-time navigation. We expect the following 
three major features for the new model: 

1. Simplicity 

2. Flexibility 

3. Accuracy 

A simple model structure has an advantage of saving computation time over a 
complex one. By flexibility, we mean that a user can easily modify some portions of 
the model, for instance, to identify additional obstacles in the world without changing 
the modeling algorithm itself. By accuracy, we mean that the model should represent 
the main landmarks of the environment precisely. This feature is important to support 
self-localization (pose determination), obstacle avoidance, and path planning with 
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minimal errors; the model must reflect exactly what the robot expects to “see” from 
its position and orientation. 


B. 2D POLYGONAL WORLD 
1. Polygons 

a. General Definitions 

Polygons are used as the basic building blocks for the world repre¬ 
sentation. A polygon B is represented by an ordered set of n distinct vertices 
{ui, U 2 , • • •, Un}, such that n > 3. Some examples of polygons are given in Figure 66 . 
A basic function tp used in representing a polygon is one that determines the next 






Figure 66 . Examples of polygons. 


>’2 


vertex for each vertex Vi. That function ip : B ^ B is defined as follows [Ref. 68 ]: 


ip{vi) 


u,+i, if 1 < z < n. 
Ui, if i = n. 


An example of such a polygon’s representation is shown in Figure 67. The polygon 
shown has four vertices Vi,V 2 ,vz, and V 4 where <,i>(vi) = U 2 , ‘^{'^ 2 ) = U 3 , ^(^^ 3 ) = V 4 
and 9 ?(u 4 ) = Ui- The next function is a bijection, and hence, there exists an inverse 
function : B ^ B such that determines the “previous vertex” of v. A 

segment v(p(v) is called an edge joining vertex v to its next vertex <p{v). 

Definition: A polygon B is called simple if it satisfies the following two 

conditions: 
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Figure 67. A polygon. 


1 . No triple of vertices (u, (^^(u)) in B are collinear, 

2 . There is no pair of non-consecutive edges sharing a point in B [Ref. 68]. 

In Figure 66, there are three simple polygons (a,b, and c) and one non-simple polygon 

(d). 

In our environmental modeling, only simple polygons are used. There 
are two types of polygon, based on the direction in which its vertices are traversed 
(Figure 68). 

• Counterclockwise (CCW) polygon, which is used to represent an obstacle in 
the operating environment. 

• Clockwise (CW) polygon, which is used to represent an outermost boundary 
of the operating environment. 

b. Data Structures 

A polygon is represented as a doubly-linked list of vertices. The repre¬ 
sentation of each vertex Vi has its own (x, y) coordinates, a pointer to its next vertex 
and a pointer to its previous vertex (Figure 69). 

In Figure 70, we show a data structure for the polygon shown in Fig¬ 
ure 67. 
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(A) CCW polygon (B) CW polygon 

Figure 68. CW and CCW Polygon. 

Vertex 

• x-coordinate 

• y-coordinate 

• pointer to NEXT vertex 

• pointer to PREVIOUS vertex 

Figure 69. Data structure of a vertex in a polygon. 


2. World 

A vehicle’s 2D environment is represented by a polygonal world model. Gen¬ 
erally, a world W is bounded by an outermost CW polygon and contains zero or more 



Figure 70. Data structure of a 4-vertex polygon. 
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of CCW polygons (obstacles) inside that boundary. An example of a polygonal world 
is given in Figure 71. 


World Boundary (CW Polygon) 


Obstacles (CCW Polygons) 


Figure 71. A simple polygonal world. 


а. World Data Structures 

A world model consists of a linked list of polygons. As described pre¬ 
viously, each polygon is represented by a doubly linked list of its vertices. Figure 72 
illustrates the general data structure used for the representation of a 2D world model. 

б . Yamabico’s 2D World 

For our vehicle’s environment, we choose to represent the 2D world with 
one CW polygon to represent the boundary of the floor. CCW polygons (obstacles) 
were not included in this stage of implementation. However, they can be added later to 
the model, if it is needed. The world coordinates are shown in Figure 73. Orientation 
of a camera or a vehicle with respect to the model is shown also. Each point on the 
floor where a vertical edge stands is described by its position (x, y) and pointers to 
its next and previous vertices. Figure 74 shows the actual 2D representation of the 
floor of the environment of the robot Yamabico. 
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World 



I 

polyg m 

Figure 72. Representation of 2D world data structure. 

C. EXTENDING 2D MODEL TO 3D 

Although the 2D representation is appropriate for motion planning, it is not 
enough for the image understanding that requires additional information from the 
environment. For example, some obstacles cannot be detected by certain sensors, 
such as sonars. The motivation of using a camera and a vision system is to provide 
information about those objects in the world that are difficult to detect with sonars. 
Also, sensing the walls and other 3D objects requires the addition of height information 
to the current 2D model. 

Due to the consideration of doors and walls as landmarks for visual navigation 
tasks, it is an important issue to represent their locations with respect to the 2D 
model. For that reason, they have to be modeled in a way that gives the interpretation 
of these landmarks when the vehicle performs its motion planning. Figure 75 shows 
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Figure 73. World coordinates and orientation of 2nd floor model. 



a simple 3D world consisting of a floor, ceiling, walls, door, and door ceiling. 

The environmental 3D model of the second floor in Spanagel Hall has one floor, 
three main ceilings, and several door frame “ceilings”. We represent each of those 
as a polygon. To obtain 3D modeling, a height value should be associated with each 
polygon in the model. Thus, the height value of the floor polygon equals zero, and for 
a ceiling polygon it takes the actual height of the ceiling measured from the floor. We 
call these polygons representing floor or ceilings world components. Connecting the 
vertices from the different world components provides the 3D model. For example, 
connecting those vertices on the floor to their corresponding vertices on the ceiling 
of a door frame by vertical edges gives the information on the door frame. A simple 
example is shown in Figure 76. In this figure, the door frame is described by a sub¬ 
polygon {B,C,D,E}, a polygon {G,H,I,J}, and vertical edges BG.,CH,DI, and EJ. 
Additional description about 3D model are given in [Ref. 36]. 
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Figure 75. A simple 3D world. 


D. 2iD MODEL 

A complete 3D model may well be suitable for computer graphics applications. 
However, for robot navigation, representing detailed descriptions of environments such 
as ceilings may practically not be needed. Thus, we propose simplifying the model of 
the environment to gain more efficiency when it is interpreted by some visual tasks of 
the robot. We consider the world as a floor, plus walls and door frames only. The walls 
and doors are represented as vertical edges extending from the floor. We call that 
model a 2|D model. We expect that other portions of the world are not essential. 
Figure 77 shows the 2|D representation of the 3D world shown in Figure 75. 

That model is mainly the 2D representation of the floor, plus the information 
of the height of all model vertical lines extending from the floor. A world view should 
show only the vertical model edges. This is intended to save the computational time 
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Figure 76. Representation of a door frame in the hallway model. 

required to build a complete 3D model which includes details not relevant to the 
visual navigation of a robot. For example, the ceilings are higher than the vehicle 
and representing them is a waste of storage and increases the processing time. In 
Figure 78, the description of the data structure for a vertex in this model is given. 
The new model is obtained by: 

1. Eliminating any additional representation in the model data structures other 
than the 2D polygonal world of the floor, such as ceilings. 
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Figure 77. A simple 2|D world. 


2. Adding the height (z-value) of any vertical edges extending from each vertex 
in the floor. 

3. Eliminating complex functions that deal with 3D geometrical structures (such 
as constructing ceilings and connecting their points to their correspondent 
points in the floor by vertical edges). 

4. Computing model vertical lines directly from the data structures of the floor 
vertices. 

In APPENDIX: WORLD MODEL, the model description of the current operating 
environment of the autonomous robot Yamabico-11 is given. 


E. TRANSFORMING MODEL INTO 2D VIEW 

The robot must And a 2D scene from a given viewpoint po and the world 
model. After defining the data structures of the model, some graphics functions for 
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Vertex 

• 

x-coordinate 

• 

y-coordinate 

• 

z-value of its vertical edge 

• 

pointer to NEXT vertex 

• 

pointer to PREVIOUS vertex 


Figure 78. Data structure of a vertex in a polygon within the 2|D model. 

transforming the model edges into 2D view line segments can be used to perform that 
task [Ref. 36, 73]. This task includes the following steps: 

• Finding the visible points of the hallway floor from po for the 360° angle, 
regardless of its orientation. 

• Finding the vertical edges extending from the visible floor points obtained in 
the previous step. 

• Specifying the 2D viewing parameters. 

• Projecting extracted features (lines) into the 2D viewing window. 

The problem of visibility is to determine a set of visible points in the world 
from the given position of the viewing point. We call this process point visibility. To 
illustrate this concept, a simple example of a polygonal world is given in Figure 79, 
which shows the visible points from the viewing point po marked by a black circle at 
each. Several methods for visibility testing can be used. In our implementation of 
the environmental modeling for Yamabico, some of the vision system library (on an 
SGI machine) included in [Ref. 36] are portions used for the visibility testing. 

The perspective projection is a well-known technique in computer graphics and 
computer vision. The viewing volume for a perspective projection is called a frustum, 
or a truncated pyramid [Ref. 74]. This is shown in Figure 80. Usually the window 
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Figure 79. Example of point visibility. 


width and height are the same as the dimension of the image produced by the camera 
of the vision system. Values of near clipping and far clipping planes should be chosen 
appropriately to give the viewer a good sense of the spatial relationship between 
different parts of the world [Ref. 73]. One strategy in the modeling is to set the near 
clipping value to the focal length of the camera’s lens and the far clipping to be larger 
than the distance between the origin of the world and the farthest point in the world 
from the origin. The near clipping plane is used to clip objects too close or behind 
the viewer (camera), while the far plane clips those objects too far to be seen [Ref. 
75]. 

The projection from model lines to 2D view includes several steps, as shown in 
Figure 81 and described in [Ref. 73]. They are provided for the vision system library 
and implemented by [Ref. 36]. 

However, in the case of the on-board image understanding algorithms for the 
vehicle, the result is not intended for display. It will be stored in data structures 
defining the information of the virtual line segments that are expected to be viewed. 
The information for each line segment should include its two endpoints within the 
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Figure 80. Truncated perspective viewing volume. 


view plane. This will be used for the matching procedures against the actual line 
segments detected from images taken by the vehicle’s camera. 


F. MODELING RESULTS 

To implement the model of the second floor on Yamabico and integrate this 
implementation with the current vision system and MML, it is necessary to test the 
model visually through a graphical workstation to check whether it correctly reflects 



Figure 81. Steps of 3D world coordinates to 2D view projection. 
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the basic features of the environment. In this implementation, both the 3D model 
and the 2|D model were tested from several estimated positions. 

Figure 82 shows one 3D model view from a position of the camera estimated by 
a;=2032 cm, j/=124.25 cm, with the viewing orientation of 0 degree (as if the camera 
is positioned in the front of the AI and Robotics lab). The field of view angle was set 
to 64°, while the focal length was 3.0 cm. The view of the 2|D model from the same 



Figure 82. Hallway view from x=2032 cm, j/=124.25 cm, 0 = 0°, view angle=64° and 
focal length=3.0 cm. 

position using the same focal length and field of view angle is shown in Figure 83. 

By changing the value of x to 3000 cm, as if the camera were moved 968 cm 
from its previous position, the view of the 3D model shown in Figure 84 was obtained. 
The corresponding view of the 2|D view is shown in Figure 85. 

This method was efficient enough. Through these experiments, we concluded 
that the model can be added easily for the autonomous mobile robot Yamabico’s 
motion planning tasks. One interesting application for which we want to use that 
model, together with the new efficient edge detection method, is to obtain a fast 
vision-based pose determination algorithm, as we will describe in the next chapter. 
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Figure 83. Vertical edges of the hallway using 2|D model from the same view point 
as in the previous figure. 



Figure 84. Hallway view of the 3D model from a;=3000 cm, y =124.25 cm, 0 = 0°. 
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Figure So. Vertical edges of the hallway using 2|D model from the same view point 
as in the previous figure. 
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VII. APPLICATION OF EDGE TRACKING 
AND MODELING: VISION-BASED POSE 
DETERMINATION 

A. INTRODUCTION 

In the previous chapter, the modeling of the environment in which an au¬ 
tonomous vehicle navigates was described. One of the tasks to be addressed with 
such a model is the correction of the position and orientation of the vehicle with 
respect to the model. This is one of the important problems for mobile robots. A 
common method of positional identification for an autonomous vehicle is odometry, 
or dead reckoning. However, this capability is affected by accumulated errors due to 
wheel slippage and an uneven floor. Due to these effects, uncertain estimates of the 
position and orientation may affect the vehicle’s motion. For the autonomous mobile 
robot Yamabico, localization using ultrasonic sensors, 2D transformation, and linear 
fitting of the sonar return points [Ref. 11] was successfully performed. Some work 
has been done on a graphics workstation using a complete 3D model of the previous 
environment of the vehicle (fifth floor of Spanagel Hall) and an exhaustive-scanning 
edge detection method [Ref. 35]. However, by making experiments based on that 
work, we found that it takes 19-20 seconds to correct the estimated pose. Thus, we 
decided to solve the problem in an improved and more efficient way so that it will be 
suitable for further visual navigation tasks of the robot in its current environment. 

Our work is mainly aimed at efficiency and robustness improvement in prac¬ 
tical image understanding. Through our proposed improved solution to the problem, 
the work mainly includes the following two significant approaches: 

1. Applying our new edge detection algorithm using direction-controlled edge 
tracking and random hitting [Ref. 70], as described in Chapters IV and V. 

2. Using the proposed 2|D model rather than using a complete 3D model. 

Our geometrical solution to the problem is presented in this chapter as well. 
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B. PROBLEM STATEMENT 

The problem of pose determination and correction for an autonomous robot’s 
visual navigation.in an indoor environment can be stated as follows; Given an es¬ 
timated pose 9e = {^eiye-,Ze- Pei^ei^e) where pe, (fe, and Og are the roll, pitch and 
yaw angles of a camera, an input image I taken by the camera, a description of the 
environment (world) model M, and the focal length / of the camera, compute the 
correct pose q of the camera (Figure 86). We assume that if the optical axis of the 
camera is parallel to the floor( Pe = 0 and (pe=0), then the vertical edges in the real 
world will appear vertical in the image. We call the positions in the 2D world where 
the vertical edges stand control features. 

Input Image L _ 

Estimated Pose ^ 

World Model 

/ 

Focal Length 
of the Camera 

Figure 86. Pose determination problem 

C. OVERVIEW 

First, we adopt the solution of the problem of point location using distinguish¬ 
able marks, proposed by K. Sugihara ([Ref. 64]). The solution idea is used as a basis 
for the overall algorithm of the pose determination and correction. 

1. Sugihara’s Formulation and Solution of Point Lo¬ 
cation Problem (PLP) 

Let pi, p 2 ,..,Pk be points on a plane and B = (e; rj,, ...,rfc) be a bunch of rays. 

Find the motion M such that the ith ray in M{B) passes through p,- for all 



Correct Pose 
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i = 


In [Ref. 64], the rays are considered to represent the directions in which a set of 
k control features (vertical edges) in the world are observed. Additionally, consider 
a situation where these rays are passing through image plane’s extracted vertical 
edges. The solution addressed there was that if A:=3, the camera position is usually 
determined uniquely. 

2. Camera Pose Determination as PLP 

We think of the motion M as the pose of the camera (robot) defined by : 

• The position v of the camera in the plane, where the rays ri, r 2 ,..,r;t are 

originating from v to some other points (pi, p2y,pk) in the plane. 

• The orientation 9 of the camera heading. 

To apply the solution idea in [Ref. 64], let = Lp\vp2 be the viewing angle 
from V to Pi and p2i and 02 = Ip2vp3 be the viewing angle from v to p2 and ps. 
Since we have two constant angles 0i and 02-, and there is only one possible point 
in the given world which has viewing angles 0i and 02 between the control features 
Pi, p 2 , and p 3 , then the unique position should be the precise camera position. It is 
determined by a point v on the intersection of two circles, one passes through pi and 
P 2 such that the angle of the arc pi p 2 is equal to 0i and the other passes through p 2 
and p 3 where the angle of the arc p 2 ps is equal to 02 (Figure 87). 

D. ALGORITHM 

The algorithm is described in Figure 88. Using the 2|D model, we find the 2D 
view Ml of the 2|D model M that represents those vertical edges that are expected 
to be seen by a camera with a focal length / at a given pose (Line 1), as in 
Figure 89. A vertical line in this 2D view should correspond to a single vertical edge 
in the modeled world. Using our straight edge detection method, we find a set M2 
={^i) hi ^ 3 } of the detected vertical line segments L (Lines 2 and 3). 
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Pi P2 



Figure 87. The intersection of two circles passing through three control features 
uniquely determines the camera position. 


CorrectPose(/, M, J, qe) 
begin 

1. Ml = ProjectModel(9e?/j Af) 

2. £ = detect Edges(J) 

3. M 2 = Longest3VerticalEdges(£) 

4. for (each U 6 M 2 ) 

5. e,- = FindBestMatchedModelVerticalLines(/,', Ml) 

6. Pi = 2DworldCoordinatePosition(e,) 

7. Pi = ViewAngle(/i, h, /) 

8. P 2 = ViewAngle(/ 2 , h-, f) 

9. V = FindActualPose(pi,p25P3; A5/^2)9e) 

10. 9 = CorrectOrientation(/, u,pi,P2,P3,2’, M 2 ) 

11. return q = (u, 9) 
end 


Figure 88. Overview of a fast pose determination algorithm. 
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Figure 89. Vertical edges of model expectation 2D view. 


To find the correspondence between the features in the 2 |D model of the 
environment and those that exist in the image, it is expected that there exist three 
vertical edges Ci, 62 , and 63 in the real world defined by the given model M that 
best match those three vertical image lines in M 2 (Line 5). The matching process is 
dressed within the old method in [Ref. 35] and is used in this application. 

After obtaining the 2D world coordinates of the positions ^ 1 ,^ 2 , and pz on the 
floor for each of these three model vertical edges (Line 6 ), the viewing angles from 
the estimated position to those three points are computed. Let x \, X 2 and xz be the 
x-coordinates of the intersection point of each vertical image line /; (for 1 < f < 3) and 
the horizontal projection of the image plane. Furthermore, let xq be the x-coordinate 
of the image plane’s center. Since the distance to the image plane from v is the focal 
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Figure 90. Viewing angles from the camera position v to three positions on image 
plane lying on the three vertical image lines. 

length, the viewing angle /3i (between image lines and I 2 from v) and the viewing 
angle ^2 (between I 2 and Iz from u) can be computed (Lines 7 and 8) as follows : 

ft = tan-(^1^) - tan-(^1^) , 
ft = 

As stated earlier, if pi, p 2 , and pz are three control features, then it can be 
expected that /?i = Lpivp 2 be the viewing angle from the actual camera’s position 
V to Pi and p 2 , and /?2 = ^Pavpz be the viewing angle from v to p 2 and pz- Given 
the coordinates of the three control features and the viewing angles and /? 2 , the 
position V can be computed, as we will describe in the next section. 

1. Finding Correct Position 

The inputs to this task are the three positions pi,p 2 , and pz of the world’s 
vertical edges, the computed viewing angles and /? 2 , and the estimated pose qe 
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Figure 91. Viewing angles from v to three control features in the world. 
(Figure 92). The output is the {x,y) coordinates of the correct position v. To find 


Precise Pose 

- 5 »- 

V 


Figure 92. Finding precise position of the camera, 
the correct position of v, we will use Figure 93 and the following steps: 

1. Let El = lpiP 2 with orientation Qi = ^(pi,p 2 )? and E 2 = pTpz with orientation 
02 = 'I'(p2,p3). 

2. Compute the center ci of the first circle passing through pi and p 2 as follows: 
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Figure 93. Geometrical relation between the correct position v and the two circles 
passing through the three control features. 

• find the middle point = {xq^,yq^) = ( ^ 1 +^2 ^ y\+n ^ of pip 2 , where pi = 
(xi,t/i) and p 2 = (a; 2 ,y 2 )- 

• compute si = 

• the center is computed as follows: 


Cl 


(xgj + Si cos(ai - ^), + Si sin(ai - ^)) 

(x,, + Si sin(ai),y,i - Si cos(qi)). 


3. Compute the radius J?i of the first circle: 

P ^ 

^ 2 sin/?i 

4. Similarly, the second circle’s center C 2 and radius R 2 is computed. 

0 . Given the two centers Ci = (xi,yi) and C 2 = (x 2 , 2 / 2 ), and the two radii Ri and 
i? 2 , the exact position v = {x,y) should satisfy the conditions 

{x-x,f^{y-y,f = Rl (VII.l) 
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and 


(x - X 2 f + {y- vt? = Rl 


From these two equations, we get 


- 2xxi + + 2/2 - 22/2/1 + y'i = R\, 


2 _ d2 


(VII.2) 


(VIL3) 


and 

x2 - 2 xx2 + Xj + 2/^ - 22/2/2 + = Rl- 

By subtracting Equation VII.4 from Equation VIL3 we get 


(VII.4) 


- 2x(xi - X 2 ) + xl-xl- 22 / 2/1 + 22 / 2/2 + yl-yl = Rl - R-l, (VII.5) 


which yields 


^ ^ xi + X 2 2/1 - y 2 yl - yl 

2 2(xi^X2)^ 2(xi — X2) 

= Ki+K^y, 


p2 

2(^1 ^ 2 ) 


(V 11 . 6 ) 

(VII.7) 


where 


^ a;i + X2 yl-yl-Rl + R^ 

2 2(xi-X2) 


and 


lu = 


yi - 2/2 


2(xi-X2)' 

Solving for Equation VII.3 and substituting for x in the equation, we obtain 


{Ki + I< 2 y) - + I< 2 y)xi + Xj + 2 / - 22 / 2/1 + 2/1 = i?i, 


or, equivalently, 


Kl + 2 KiK 2 y + K^y^ — 2 K.\X\ — 2 K 2 X\y + Xj + 2 /^ — 2 yy\ + 2/1 ~ R\ — 0 


,.2 d 2 


NoteThis equation is in the form 


Ay‘^ + ^2/ + C = 0, 
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where A = /i| + 1,5 = 2{KiK2 — K 2 ^\ — yi), and C = — 2/\iXi + xj + j/j — R^. 

By computing the constants Ki and K 2 and using them with the other known 
values of Xi, j/i, and i?i, we can substitute for A, B, and C in the two solutions of the 
second order equation which are y = —B ± • One of the values of y should 

be very close to that of the point P 25 so the other value is taken. Using that value of 
y in Equation VIL7, x can be computed. Now the correct position is u = (x,y). 

2. Finding Correct Orientation 

After finding the correct position v — (x,y), the correct orientation, 0, is 
computed as follows: 

For each control feature p,, (1 < f < 3), the orientation ^(u,p,) is computed. 
Next, the angle 8i is computed using the focal length, Xj-values for each corresponding 
vertical line on the image plane, and xq, the x*coordinate of the image plane’s center 
(Figure 94). Then the difference 9i = ^(u,p,) — 6i is computed. Finally, 6 is obtained 
by averaging these values. The algorithm is given in Figure 95. 



Figure 94. Geometry of finding camera orientation. 


E. EXPERIMENTAL RESULTS 

Using the model of the new environment of the autonomous robot Yamabico, 
the algorithm was tested on some images taken by a CCD camera. We estimated the 
position and orientation in which each image was taken. Several experiments were 
performed to verify that the algorithm returns the correct pose. The difficult part 
was specifying the focal length to obtain the correct model view that will be matched 
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CorrectOrientation(/, Pi,P 2 ,P 3 , xq, xi, X 2 , xs) 


begin 

1. for (each pi, 1 < i < 3) 

2. = Orientation(t>,pi) 

3. 6, = tan-i ( 2 ^) 

4 . = Si 

5. return 6 = Average(0i,02)^3) 
end 


Figure 95. Algorithm of computing the correct camera orientation 6. 


with the actual image. In Figure 97, we show an example of correcting the estimated 
pose in Figure 96. 



Figure 96. Estimated pose x=2040 cm, y= 120.0cm, 9 = 0.0°. 


The magnitude of the error for that position was 10.77 cm for x, 3.77 cm 
for y and 1.97° for 0. The execution time on a 20MHz SGI machine was about 4- 
5 seconds which is about 20% of the execution time of the old pose determination 
method. Thus, we believe that a significant improvement was achieved by using the 
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Figure 97. Correct pose a:=2050.77 cm, y= 116.23, 6 — 1.97°. 


new efficient edge detection algorithm and the modeling method. 



























VIII. IMPLEMENTATION PLATFORM: 

YAMABICO-11 


In this chapter, a brief description of the hardware and software system ar¬ 
chitecture of the robot Yamabico-11 is given. The on-board image understanding 
system is also described here. This is intended as background information about the 
platform on which the research in this dissertation was directed. 


A. YAMABICO HARDWARE SYSTEM 

The autonomous mobile robot Yamabico-11 (Figure 98) is an experimental 
robot used for indoor robotics research at the Naval Postgraduate School. The efforts 


Figure 98. Autonomous mobile robot, Yamabico-11. 


of a research team led by Professor Yutaka Kanayama have added many significant 
contributions to that platform over several years. Many research activities have been 
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involved in the project, including motion planning [Ref. 9, 10, 11], automated car¬ 
tography [Ref. 33], sonar-data interpretation [Ref. 18, 19], and sonar-based obstacle 
avoidance [Ref. 34]. Additional contributions have been made at the University of 
Electro-Communications, the University of Tsukuba, Stanford University, and the 
University of California at Santa Barbara [Ref. 76, 77]. 

For image understanding, several efforts constructed functions that can be 
used for further image understanding tasks ( [Ref. 35, 36]). A vision system was 
added to the robot, as described in [Ref. 38]. The Yamabico hardware system is 
described in Table V. Some of the technical information in this table is based on 
[Ref. 78]. The hardware architecture of Yamabico is illustrated in Figure 99. 


Power Source 

Two 12-volt batteries 

CPU 

IV-SPARC, 33MHz 

RAM 

16 MB DRAM 

MIPS 

24 

MFLOPS 

5 

CPU Interface 

VMEbus Interface 

EPROM 

Two 64K X 16 EPROMs 

Driving Wheels 

2 

Free Wheels 

Four casters (for balance) 

Communication With 
UNIX System 

Ethernet (lOBase-T) 

Sonar System 

Sonar Beam Range 

Sonar Coverage 

Twelve 40KHz sonars (3 groups) 

4 meters (approximately) 

360 degree 

Motors 

Two 35 watt DC motor with shaft encoders 

Motor Control 

Dual Axis Controller 

Vision System 

image board with CCD camera (B&W) 

I/O Interface With Users 

Macintosh PowerBook 


Table V. Yamabico’s main hardware specifications. 


B. ON-BOARD VISION SYSTEM 

In this section we give a brief description of the on-board vision system. The 
installation of an image board and a CCD camera on the robot was an important 
step in Yamabico’s development. That was the initial step in the on-board image 
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Figure 99. Block diagram of Yamabico-11 hardware architecture. 


understanding system. The efforts of J. Kisor [Ref. 38] in integration of image 
hardware/software became the groundwork for the future image understanding tasks. 
The main references for this section were his thesis and the technical manuals of the 
image board and the camera used. 

1. IMS Image Board 

The IMS (standard image manager) is a single VME board containing several 
components such as image memory, acquisition module (AM), computational module 
(CM) and display module (DM). These modules provide flexible camera and display 
interfaces. Additionally, they provide local processing capabilities. The acquisition 
module provides an interface between the camera and the IMS board. The display 
module provides an interface with a display monitor [Ref. 79]. The image board is 
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mapped to the addressing space of the VME bus as shown in Table VI. The IMS 


Module 

Address 

Image Memory Frames or Display Module 

OxfaOOOOOO 

IMS registers 

oxfcoooeoo 


Table VI. Mapping of image modules to the VME bus address space. 

image memory stores digitized images at three frames; AO, Al, and Bl. Each frame 
is 1024 pixels x 1024 lines, with an eight-bit per pixel. 

2. CCD Camera 

The image understanding system on Yamabico includes a Cohu 4110 CCD 
camera from Cohu Inc [Ref. 80]. The Cohu 4110 is a digital output camera featuring 
a 1/2 inch format CCD image sensor. The area of active imaging is 6.4mm x 4.8mm 
and 739 x 484 pixels. Table VII shows the main characteristics of the Cohu camera. 
For more information see [Ref. 80]. . 


imager 

single CCD 

Image Sensor Size 

6.4mm x 4.8mm 

Video Output 

digital with analog option 

Resolution 

739 X 484 pixels 

Num. of bits/pixel 

8 bits 

Gray Levels 

256 

Focal Length 

variable (depends on the lens used) 


Table VII. Summary of Cohu CCD camera capabilities. 


C. MML SOFTWARE 

The Model-based Mobile-robot Language (MML) is a library of functions writ¬ 
ten in the ANSI C language in the UNIX environment. This library is used as a basis 
for all user programs and commands to the robot. It supports motion control func¬ 
tions, sonar functions, I/O functions, image understanding functions, and others. The 
last version developed for Yamabico is called MML-11. A new version (MML-12) for 
motion control is currently under development. The image understanding portion of 


100 














the software was integrated with MML-11 for experimental purposes [Ref. 38]. In 
the future, it should be integrated with the new version MML-12. 

When a user writes a program and wants to execute it on Yamabico, the 
program takes the form of a user file called user.c which includes some commands to 
the robot that should be performed sequentially. After compiling the program, the 
executable file is ready to be downloaded to the robot through an Ethernet connection. 
After downloading the program, the robot becomes self-contained. 

The function library is a set of user functions to provide the interface between 
the user and the system. Before adding the image understanding functions, the user 
functions were categorized into four modules [Ref. 11]: 

• Operating System Module, 

• Motion Planning Module, 

• Motion Control Module, 

• Sonar Control Module. 

An important feature in the software system of Yamabico is its interrupt han¬ 
dling. The motion control has the highest priority. The interrupt occurs every 10 
msec. The sonar control can issue an interrupt request every 50 msec in the case of 
the existence of sonar returns. The user programs have the lowest priority, but it 
gives the system all the requested user commands through a sequential buffer. 

D. IMAGE UNDERSTANDING SOFTWARE 

The image understanding software was added first to the robot as a part of 
the work addressed in [Ref. 38]. Some low-level image functions were included with 
the system through that work. In the following, we list those main low-level image 
functions: 

1. IMS initialization : for initializing the image board. 

2. Display Initialization : for setting the monitor and display module registers. 
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3. Acquisition Initialization: for setting registers that control the camera or ac¬ 
quisition module interface. 

4. Image Transfer (setInputPath, setFrameAcquire, acqEnable, and interlacePage2). 

1. Basic User-Level Image Functions 

Adding an image understanding capability to the robot requires the addition 
of an image function library. It is intended to hide the low-level functions from the 
user, who may then issue user-level commands in place of a sequence of low-level 
functions. These low-level functions were originally designed by John Kisor [Ref. 38]. 
Specifically, the functions acqEnable and interlacePage2 can be categorized in the 
user-level ones. All the user-level commands are contained in the Appendix: ON¬ 
BOARD IMPLEMENTATION, Part 4. The added user-level commands are shown 
in Table VIII. 


Function 

Purpose 

initImgSys() 

Initialization of the image frames 

setCamera() 

set the input path from the camera to the 3 image frames 

start Grab 0 

direct the image board to start grabbing 

continuous video image from the camera to image frames 

stopGrab() 

direct the image board to stop grabbing the video image 

snap() 

take a snapshot of a still image into image frames 

clear Frame (frame) 

set all pixel data in a specified frame to 0 

clearFrames() 

set all pixel data in all frames to 0 


Table VIII. Summary of basic user image functions. 
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IX. CONCLUSIONS AND DIRECTIONS 
FOR FUTURE WORK 

A. CONCLUSIONS 

This research has solved the efficiency problem for the model-based image 
understanding tasks of autonomous vehicles. A new efficient straight-edge detection 
method using direction-controlled edge tracking and random hitting algorithms was 
invented for the purpose of reducing the computational time. This method avoids 
the exhaustive pixel processing executed in classical edge detection methods. We 
proposed using a pseudo-random number scheme to find a starting pixel from which 
an edge tracking for a linear sequence of pixels is performed. A robust least-squares 
fitting algorithm was used to obtain the geometric features for the line segment that 
best fits the pixel sequence including the endpoints. The directional information 
of the line segment was used to control the edge tracking task. During this edge 
detection method, duplicate tracking of the same edge was avoided by the closeness 
test. 

In real experiments, we were able to detect 20 major edges in a test image 
with a very small number of pixels processed (4.36 % of 313,956 total pixels in the 
image). The results show that this algorithm is useful and efficient for numerous 
image understanding applications and autonomous robot visual navigation. 

An effective method for modeling the world environment of an indoor au¬ 
tonomous vehicle is presented to give the main features of the world in a simple and 
efficient manner. The two methods were used to improve the efficiency of the pose 
determination algorithm. The execution time was only about 20-25% of that with a 
3D model and exhaustive scanning method for edge detection. 

These methods were first successfully implemented on a graphics workstation. 
Second, the code was partially ported to the autonomous mobile robot Yamabico-11. 
A user-level image function library was added to the current MML software system. 
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so that a user can easily program high-level on-board vision algorithms. 


B. FUTURE WORK 

In the following points, some suggestions for future work: 

• Investigating different values to be used for c in the pseudo-random hitting 
method that satisfies the uniform-distribution property, and finding a mathe¬ 
matical model that computes the best scheme for the pseudo-random number 
generator, based on the image width and height. 

• Investigating the use of the new edge detection method with multi-resolution 
images, as those dealt with in [Ref. 81]. 

• Implementing the 2|D model and pose determination in real time for self 
localization. 

• Including the presented direction-controlled edge-tracking method with an ob¬ 
stacle avoidance method may result in a robust, intelligent behavior of the 
robot. 

• Investigating sonar/vision fusion to verify the localization correction. 

• There were some minor problems in edge detection in some indoor images due 
to the reflection and lighting conditions. Solving this problem will improve 
this edge-tracking method. 

• Complete implementation of the image-understanding function library. 
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APPENDIX A. ON-BOARD 
IMPLEMENTATION 


The new edge detection algorithm was implemented on the autonomous mo¬ 
bile robot Yamabico-11. We present the data structures and the program design as 
implemented on the robot Yamabico. The program design is described in Figure 100. 



Figure 100. User program description. 


In this user program, a call to the function initlmgSys performs initialization 
of the image frames AO, Al, B1 to zero. The function readimage sets the camera 
signals to be the input to the image board, then starts grabbing an image into frames 
AO, Al, and Bl. After setting the appropriate memory location for the acquired 
image, a call to the function detectEdges() is applied on the image to perform the 
edge detection task. The structure of readlmage() function is described in Figure 101, 
while the structure of detectEdges is shown in Figure 102. 



Figure 101. Structure of the readlmage() function. 

An example of a user program is given below, followed by the data structures 
and the edge detection functions. 
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Figure 102. Edge detection program description. 


1. AN EXAMPLE OF USER PROGRAM 


FILENAME : user.c 

DESCRIPTION: read image and detect edges using 

Direction Controlled Edge Tracking and 
Random Hitting 
Author : Khaled Morsy 


#include "user.h" 

#include "displayCtrl.h" 
#include "acquisitionCtrl.h" 
#include "imsControl.h" 
#include "imgLib.h" 

#include "trackEdge.h" 


/**** Local Prototypes 


int 

userO { 

initlmgSys(); 
readImageO ; 
detectEdges(); 
stopGrabO ; 
logLinesO ; 
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2. DATA STRUCTURES DEFINITIONS 

FILE NAME : imgDefs.h 

DESCRIPTION: Data structures Definitions for image 

understanding programs, specifically for the 
new edge detection algorithm. 

AUTHOR : Khaled Morsy 

/**♦*♦***♦♦♦*♦*♦♦♦♦♦♦♦♦*♦ =|t********J)c**^>(: **♦♦♦*♦*♦♦ **♦♦*♦ / 

typedef struct 

{ 

POINT p; 
double gm; 
double phi; 

} PIXEL; 

typedef struct 
{ PIXEL pi; 

PIXEL p2; 

PIXEL p3; 

} GROUP; 


typedef struct 
{ 

/* Least Squares Fit moments*/ 


double 

mOO; 

/* 

double 

mlO; 

/* 

double 

mOl; 

/* 

double 

mil; 

/* 

double 

m20; 

/* 

double 

m02; 

/* 

double 

mux ; 

/* 

double 

muy ; 

/* 

double 

alpha; 

/* 

double 

r; 

/* 

POINT 

qi; 

/* 

POINT 

q2; 

/* 

} EDGE; 




of IMG LINE ♦/ 
distance from c 
segment */ 
last pixel of i 
direction */ 
last pixel of < 
direction */ 
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/* same as mOO */ 


typedef struct 
{ double n; 
double alpha; 
double r; 
POINT el; 
POINT e2; 

} IMG_LINE; 
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3. FUNCTIONS OF THE DIRECTION-CONTROLLED 
EDGE TRACKING WITH RANDOM HITTING PRO¬ 
GRAM 

/ *♦****♦**♦**♦♦**♦**♦**♦♦*♦♦**♦*:(=♦♦*♦♦****!(;*♦♦***♦;♦:*♦:** 

FILE NAME : trackEdge.h 

PURPOSE : Edge detection functions 

DATE : Apr. 9, 1997 

Notes : Yamabico Version. 

AUTHOR : KHALED HORSY 


Sdefine THRESHOLD 5000 
#include "imgDefs.h" 


/*** Global Variables */ 

double phiO , gO, phil; /♦ pO’s gradient direction and 

magnitude */ 


int temp = 1; 
int H =476; 
int W = 732; 

/sit**:**:*!***:*:*****:*:**:*;********* PROTOTYPE **************/ 
long PseudorandomClong,long,long,long) ; 

POINT ConvertTo2D(long); 
int NotOnImageBoundary (POINT) ; 
int significant(POINT); 

void ComputeGradient(POINT, double *, double * ); 
double SumSqrs(double , double); 

POINT FindInitialPixel(POINT); 

void TrackEdge(POINT,EDGE ,IMG_LINE*); 

void InitializeSequence(POINT , EDGE ♦) ; 

double normalize2(double) ; 

void TrackPixels(POINT,double,EDGE *) ; 

POINT SelectNextPixeKPOINT,double) ; 

GROUP Neighbors(POINT , double); 

PIXEL LargestGradientPixel(GROUP); 

int Consistent(double ,double ); 

void DeletePixel(PIXEL, GROUP*); 

void LeastSquaresFitting(EDGE *,POINT); 

void ComputeSegmentData(EDGE * , IMG_LINE *); 

double normalize(double) ; 
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void InitLogLines(char*, int) ; 


/ :4c *** :^c :4c ** 5|e :4c 3jc ** 5}c ** :4c * :4s 3jc :4c 3}c:+:5}c jjc 3je :4c 3jc jf: jjc :^c sf: :4c 5jc 5ic * 5jc jjc jf: 3jc * 5*c 5f: 5}c3jc 

Function : PseudoRajidora(long rO, long c , long M) 
Purpose : Generate a Pseudo Random Number 
AUTHOR : Khaled Horsy 

:4c 4:4: :4c :4c :4c :4c :4c :4e :4e :4; :4e :4c 4; :4c 4: :4c :4c :4e :4c :4c :4e :4c :4c :4e :4c :4c :4c :4c :4c *:t:* :4c :4c :4c :4c :4c :4c :4c :4c :4c :4c :4c :4c :4c :4c :4c* j 

long PseudoRandoni(long rO, long a, long c, long M) 

{ 

return (a*rO + c) % M; 

} 

^****************************************************** 

Function : ConvertTo2D(long) 

Purpose : converts a Pseudo Random Number into image 
coordinates (x,y) 

AUTHOR : Khaled Horsy 

POINT ConvertTo2D(long prn) 

{ 

POINT p; 
p.X= prn % W ; 
p.Y = prn / W ; 
return p; 

} 

j ****************************************************** 
Function : NotOnlmageBoundary(POINT) 

Purpose : true if the point NOT on the image boundary 
AUTHOR : Khaled Horsy 

***************************: 4 :********** ♦♦*♦***♦*******♦/ 

int NotOnImageBoundary(POINT p) 

{ 

if((p.X > 2 && p.X < W-2) && (p.Y > 2 a& p.Y < H-2)) 
return 1 ; 
else 

return 0 ; 

} 

/♦J)c5|t******3|c*****)|t******=(:***=(:***** + **=l=**+=l'**+*=l'=l'*=t: + ***** 

Function : Significant(POINT pO) 

Purpose : returns true if pO is a significant pixel 


no 



and computes its gradient direction 
AUTHOR : Khaled Horsy Jan 97 

int significant(POINT pO ) 

{ 

double gx,gy ; 
double g; 

if(NotOnImageBoundaxy(pO)) 

{ 

ComputeGradient(pO,&gx,&gy); 
gO = SumSqrs(gx,gy); 
if (gO > THRESHOLD) 

{ phiO = atan2(gy,gx); 
return 1 ; 

> 

else return 0; 

} 

return 0 ; 

> 

Function : ComputeGradient(POINT p0,gx,gy) 

Purpose ; compute gx and gy (horizontal and vertical 
gradients) by SOBEL OPERATOR 
AUTHOR ; Khaled Horsy Jan 97 

void ComputeGradient(POINT p, double ♦gx , double *gy) 


/♦ this deals with frame B1 ♦/ 
int x,y,gxi,gyi; 

BYTE ul,l,dl,u,d,ur,r,dr; 
unsigned long p0,b = OxfaOOOOOO ;; 

x=(int) p.X; 
y=(int) p.Y; 
pO = b + y * 1024 + X ; 

/* the direction of pixels on-boaxd are different from 
SGI machine, it is left to right top to bottom 
a pixel and the one on the next line differs by 
IK */ 
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1 = *(BYTE*) (pO-1); 

/♦ 

left pixel 

♦/ 

r = *(BYTE*) (pO+1); 

/♦ 

right pixel ♦/ 

u = ♦(BYTE*) (pO-1024); 

/♦ 

upper pixel ♦/ 

ul = ♦(BYTE*)(pO-1025); 

/♦ 

upper-left 

♦/ 

ur = *(BYTE*)(p0-1023); 

/♦ 

upper-right ♦/ 

d = *(BYTE*) (pO+1024); 

/♦ 

down pixel 

♦/ 

dl = *(BYTE*)(p0+1023); 

/♦ 

down-left 

♦/ 

dr = *(BYTE*) (pO+1025); 

/♦ 

down-right 

♦/ 

gxi = -ul-l-dl+ur+r+dr; 




gyi = -dl-d-dr+ul+u+r; 




♦gx = (double) gxi ; 





*gy = (double) gyi ; 

} 


Function : SumSqrs(a,b) 

Purpose : simply compute a*a + b*b 
AUTHOR : Khaled Horsy 

double SumSqrs(double a, double b) 

{ 

return a*a + b*b ; 

} 

FUNCTION : TrackEdge(PIXEL pO , EDGE *Q ,IMG_LINE *L) 
PURPOSE : track pixel sequence in two directions 
AUTHOR : Khaled Horsy 

void TrackEdge(POINT pO , EDGE Q ,IHG_LINE *L) 

{ 

InitializeSequence(pO, &Q); 

TrackPixels(pO, PI/2, &C|) ; 

TrackPixels(pO, -PI/2, &Q); 
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ComputeSegmentData(&Q,L); 


} 


FUNCTION : InitializeSequeiice(POINT p, EDGE *Q) 

PURPOSE ; Initailize Least-Squares fitting parameters 

AUTHOR ; Khaled Morsy 

void InitializeSequence(POINT p, EDGE *0) 

{ 


C)->mOO 

Q->mlO 

C)->m01 

Q->m20 

Q->m02 

Q->mll 

Q->alpha 

q->r 

Q->ql.X 

q->ql.Y 

q->q2.X 

q->q2.Y 


1 ; 

p.X; 

p.Y; 

p.X ♦ p.X ; 
p.Y * p.Y ; 
p.X * p.Y ; 
normalize2(phi0) ; 
0.0 ; 
p.X ; 
p.Y ; 
p.X ; 
p.Y ; 


FUNCTION : normalize2() 

PURPOSE : return the normalized value of orientation 
in range -PI/2 , PI/2 
AUTHOR : Khaled Morsy 


double normalize2(double o) 

{ 

while ( o > PI/2) o = o - PI ; 
while ( o < - PI/2 ) o = o + PI ; 
return(o); 

} 


/***** 5|C *************** 5jC ******** ******* 5(C5(C5tC ******** 5|C**5|C* * 

FUNCTION : TrackPixels(POINT pO, double beta, EDGE *q) 
PURPOSE : track only one side from pO 
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AUTHOR : Khaled Horsy 

void TrackPixels(POINT pO, double beta, EDGE *Q) 

{ 

POINT pl,p2; 
double psi; 
pl=pO; 
phil=phiO; 
while(l) 

{ 

psi = 0->alpha + beta ; 

p2 = SelectNextPixelCpl,psi); 

if(p2.X == 0.0 && p2.Y == 0.0) 

{ 

if(beta == PI/2) 

■C q->ql.X = pl.X ; 
q->ql.Y = pl.Y ;} 

else 

{ q->q2.X = pl.X ; 
q->q2.Y = pl.Y ; 

} 

breaik; 

> 

LeastSquaxesFitting(q,p2); 

pl=p2 ; 


} 

} 

FUNCTION : SelectNextPixel(POINT pi, double psi) 
PURPOSE : select one pixel out of 3 neighbors 
AUTHOR : Khaled Horsy 

POINT SelectNextPixel(POINT pi,double psi) 

{ 

PIXEL pxl2; 
double phi2; 
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int num_pixels_in_group = 3; 

GROUP S; 

POINT p2; 

S = NeighborsCpl, psi); 
while(nuni_pixels_in_group > 0) 

pxl2 = LargestGradientPixel(S); 

if(pxl2.p.X != 0.0 && pxl2.p.Y != 0.0 ) 

{ phi2 = pxl2.phi; 

if(Mot0nIiaageBoundary(pxl2.p) && Consistent(psi,phi2)) 
return pxl2.p; 

} 

DeletePixel(pxl2,&S); /* make gm=0 of corresp. pixel to 

pxl2 in S */ 

num_pixels_in_group— ; 

} 

p2.X = 0.0; 
p2.Y = 0.0; 
return(p2); 


FUNCTION : Neighbors(POINT p, double psi) 

PURPOSE : find neighbors of a given point in the psi 
direction. 

AUTHOR : Khaled Horsy 

I :4c :4c :4: :4c :4c :^c5|c :4c :4c j 

GROUP Neighbors(POINT p, double psi) 

GROUP S; 

POINT ppl,pp2,pp3; 
double gx,gy,gm; 

if(psi < - PI) 
psi = -PI ; 

if(psi > -PI/8 && psi <= PI/8) /* range DO */ 

ppl.X = p.X + 1 ; ppl.Y = p.Y -1; 
pp2.X = p.X + 1 ; pp2.Y = p.Y ; 
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pp3.X = p.X + 1 ; pp3.Y = p.Y +1 ; 


else if(psi > PI/8 && psi <= 3*PI/8) /* range D1 ♦/ 

{ 

ppl.X = p.X + 1 ; ppl-Y = p.Y ; 

pp2.X = p.X + 1 ; pp2.Y = p.Y + 1 ; 

pp3.X = p.X ; pp3.Y = p.Y + 1 ; 

> 

else if(psi > 3*PI/8 && psi <= 5*PI/8) /* range D2 */ 

ppl.X = p.X + 1 ; ppl.Y = p.Y + 1 ; 
pp2.X = p.X ; Pp2.Y = p.Y + 1 ; 

pp3.X = p.X -1 ; pp3.Y = p.Y + 1 ; 

} 


else if(psi > 5+PI/8 && psi <= 7+PI/8) /* range D3 */ 

{ 

ppl.X = p.X ; ppl.Y = p.Y + 1 ; 
pp2.X = p.X-1 ; pp2.Y = p.Y + 1 ; 

pp3.X = p.X -1 ; pp3.Y = p.Y ; 

} 


else if((psi > 7*PI/8 && psi <= PI) I I 

(psi >= -PI && psi <= -7*PI/8)) /* D4 */ 

{ 

ppl.X = p.X-1 ; ppl.Y = p.Y + 1 ; 

pp2.X = p.X-1 ; Pp2.Y = p.Y ; 

pp3.X = p.X -1 ; pp3.Y = p.Y -1 ; 

} 

else if(psi > -7*PI/8 && psi <= -5*PI/8) /* range D5 */ 

{ 

ppl.X = p.X - 1 ; ppl.Y = p.Y ; 
pp2.X = p.X - 1 ; pp2.Y = p.Y -1 ; 
pp3.X = p.X ; pp3.Y = p.Y - 1 ; 

} 


else if(psi > -5*PI/8 && psi <= -3*PI/8) /* range D6 */ 

{ 

ppl.X = p.X - 1 ; ppl.Y = p.Y-1 ; 
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pp2.X = p.X ; pp2.Y = p.Y -1 ; 
pp3.X = p.X + 1 ; pp3.Y = p.Y - 1 ; 

} 

else if(psi > -3*PI/8 && psi <= -PI/8) /* range D7 */ 

{ 

ppl.X = p.X ; ppl.Y = p.Y-1 ; 

pp2.X = p.X + 1 ; pp2.Y = p.Y -1 ; 

pp3.X = p.X + 1 ; pp3.Y = p.Y ; 

} 

S.pi.p.X = ppl.X; S.pi.p.Y = ppl.Y ; 

S.p2.p.X = pp2.X; S.p2.p.Y = pp2.Y ; 

S.p3.p.X = pp3.X; S.p3.p.Y = pp3.Y ; 
ComputeGradientCppl,&gx,&gy); 
gm = SumSqrs(gx,gy); 
ifCgm > THRESHOLD) 

{S.pl.gm = gm ; 

S.pi.phi = atan2(gy,gx); 

> 

else 

{S.pl.gm = 0.0; 

S.pi.phi = 0.0; 

} 

ComputeGradient(pp2,&gx,&gy); 
gm = SumSqrs(gx,gy); 
if(gm > THRESHOLD) 

{S.p2.gm = gm ; 

S.p2.phi = atan2(gy,gx); 

> 

else 

{S.p2.gm = 0.0; 

S.p2.phi = 0.0; 

} 


ComputeGradient(pp3,&gx,&gy); 
gm = SumSqrs(gx,gy); 
if(gm > THRESHOLD) 

{S.pS.gm = gm ; 

S.p3.phi = atan2(gy,gx); 
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} 

else 

{S.p3.gm = 0.0; 

S.pS.phi = 0.0; 

} 

return S; 

} 

/ *****5f:**5(:** *****♦♦*♦♦♦♦****♦**♦*♦!(:***!(:**!(: ♦*♦♦♦***!(:*:)(!(: 

FUNCTION : LargestGradient(GROUP *3) 

PURPOSE : select one pixel with largest gradient 
from group S of pixels 
AUTHOR : Khaled Horsy 


PIXEL LargestGradientPixel(GROUP S) 

{ 

PIXEL pxl; 
double gml,gm2,gm3; 
gml = S.pl.gm ; 
gm2 = S.p2.gm ; 
gm3 = S.p3.gm ; 


if( gml > THRESHOLD &&(gml > gm2 && gml > gm3) ) 

{ 

if(gml - gm2 > (.1 * gm2)) 
return S.pl; 
else 

return S.p2 ; 

} 


else if(gm2 
return 


> THRESHOLD &&(gm2 >= gml && gm2 >= gm3)) 
S.p2; 


else if(gm3 > THRESHOLD && (gm3 > gml && gm3 

{ 


if(gm3 - gm2 > (.1 * gm2)) 
return S.p3; 
else 

return S.p2 ; 


> 


gm2)) 
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else 

{ 

pxl.p.X = 0.0 ; 
pxl.p.Y = 0.0 ; 
pxl.gm = 0.0 ; 
pxl.phi =0.0 ; 
return pxl ; 

} 


} 

FUNCTION : Consistent(double phi ,double phi) 

PURPOSE : check consistency between two directions 
AUTHOR : Khaled Horsy 

j 5)c5)c3jc:t: ************************************* ********** / 

int Consistent(double psi ,double phi) 

double epsilon = 20 ♦ PI / 180; 
if(fabs(normalize2(phi-psi-PI/2)) < epsilon ) 
return(1) ; 

else 

return(0); 

} 

FUNCTION ; Delete(PIXEL pxl, GROUP S) 

PURPOSE : make grad, magnitude of pxl zero, so it will 
not be used next time 
AUTHOR : Khaled Horsy 

void DeletePixel(PIXEL pxl , GROUP *S) 

{ 


if( pxl.p.X == S->pl.p.X && pxl.p.Y == S->pl.p.Y) 

.S->pl.gm = 0.0 ; 

else if(pxl.p.X == S->p2.p.X && pxl.p.Y == S->p2.p.Y) 
S->p2.gm =0.0 ; 

else 

S->p3.gm = 0.0 ; 
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} 


****************************5f:***********3tc*****3jc*54«** 

FUNCTION : LeastSquaxesFitting(EDGE Q, POINT p) 
PURPOSE : Compute The Least-Squares fitting for pixel 
sequence Q 

AUTHOR ; Khaled Horsy 

** ;<c *!(:!*:*♦ 5(: ♦♦♦:(: ^ jf: ♦ :)c Jf: ♦ !)c ^ * !tc * + !(:!*:**♦* / 

void LeastSquaresFitting(EDGE *Q, POINT p) 

{ 

double M20, Mil, M02; 

++Q->mOO ; 

Q->mlO += p.X; 

Q->m01 += p.Y; 

C!->m20 += p.X * p.X ; 

Q->m02 += p.Y * p.Y ; 

Q->mll += p.X * p.Y ; 

M20 = Q->m20 - (Q->mlO)*(Q->mlO)/Q->mOO ; 

Mil = Q->mll - (Q->ml0)*(Q->m01)/Q->m00 ; 

M02 = Q->m02 - (Q->m01)*(Q->m01)/Q->m00 ; 

Q->alpha = 0.5 * atcLn2(-2*Mll , M02-M20) ; 

} 

I >f: sf: 5f: 4:5f: :*:+:**:************************************* ******** 

FUNCTION. ; ComputeSegmentData(EDGE *Q, IMG.LINE ^L) 
PURPOSE ; construct Line Segment data struct. 

AUTHOR : Khaled Morsy 

/jt::^:^**^***********************************************/ 

void ComputeSegmentData(EDGE *Q ,IMG_LINE *L ) 

{ 

double delta; 

L->n = Q->m00 ; 

L->alpha = Q->alpha ; 

L->r = Q->ml0/Q->m00 * cos(Q->alpha) + 
Q->m01/Q->m00 * sin(Q->alpha) ; 
delta = Q->ql.X * cos(Q->alpha) + 

Q->ql.Y * sin(Q->alpha) - L->r ; 

L->el.X = Q->ql.X - delta * cos(Q->alpha); 
L->el.Y = Q->ql.Y - delta * sin(Q->alpha); 
delta = Q->q2.X * cos(Q->alpha) + 
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Q->q2.Y * sin(Q->alpha) - L->r ; 

L->e2.X = Q->q2.X - delta * cos(Q->alpha); 
L->e2.Y = Q->q2.Y - delta * sin(Q->alpha); 

} 

/*****>l<**>(:*****5(:********=i<*********5it****:(c:(:***J(:**Jicj(::(c*5(:** 

FUNCTION ; AddSegment(LS, L, i) 

PURPOSE : add one segment to the array 
AUTHOR : Khaled Morsy 

void AddSegment(LS, L, i) 

IMG.LINE LS[]; IMG.LINE L;int i; 


{ 

LS[i].n = L.n; 

LS[i].alpha = L.alpha; 

LS[i].r = L.r ; 

LS[i].el = L.el ; 

LS[i].e2 = L.e2 ; 

> 

FUNCTION : NotClose(p,L,index) 
PURPOSE : Closeness Test 
AUTHOR : Khaled Morsy 

int NotClose(p,L,index) 

POINT p ; IMG.LINE L[] ; int index ; 


double dist, line_alpha ; 
int i=0; 

double px = p.X ; 
double py = p.Y ; 

double Ixl,lyl,lx2,ly2,ltheta, dx,dy; 
int temp =0 ; 

double xstar , ystar , xestar; 
double xtemp,ytemp , A,B; 
whiled < index) 

1x1= L[i].el.X ; lyl=L[i] .el.Y ; 
lx2=L[i].e2.X ; ly2 = L[i3.e2.Y ; 
if (lxl<=lx2 && lyl <= ly2) 
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{ 

dx = lx2-lxl; 
dy = ly2 - lyl; 

} 

if(1x2 <= 1x1 && ly2 <= lyl) 

{ 

dx = 1x1 - 1x2 ; 
dy = lyl - ly2 ; 
xtemp = 1x1 ; 
ytemp = lyl ; 

1x1 = 1x2; 

lyl = Iy2 ; 

1x2 = xtemp ; 
ly2 = ytemp ; 

} 

if (1x2 <= 1x1 && lyl <= ly2) 

{ 

dx = lx2-lxl ; 
dy = ly2-lyl; 

} 

if (1x1 <= 1x2 && ly2 <= lyl) 

dx = 1x1 - 1x2 ; 
dy = lyl - ly2 ; 
xtemp = 1x1 ; 

ytemp = lyl ; 

1x1 = 1x2 ; 
lyl = ly2 ; 

1x2 = xtemp ; 

ly2 = ytemp ; 

} 

Itheta = atan2(dy,dx) ; 

A = cos(Itheta) ; 

B = sin(ltheta) ; 

xstar = (px-lxl)* A + (py-lyl) * B; 
ystar = -(px-lxl)* B + (py-lyl) * A ; 

xestar = (lx2-lxl)* A + (ly2-lyl) * B; 

if(xstar < 0 ) 
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dist = sqrt((px-lxl) * (px-lxl) + (py-lyl) *(py-lyl)) 
else {if(xstar >= 0 && xstar <= xestar) 
dist = fabs(ystar) ; 

else 

dist = sqrt((px-lx2) * (px-lx2) + (py-ly2) ♦(py-ly2)) ; 


} 

if (fabs(dist) < 7) /* use 7 for delta */ 

return 0 ; 
else 

i++ ; 

} 

return 1 ; 

} 


FUNCTION :GetNumLines() 

int GetNumLinesO 

{ 

int kk ; 

printf("\n ENTER NUM. OF LINES 
scant("%d",&kk); 
return kk ; 

> 

FUNCTION : normalize() 

PURPOSE : return the normalized value of orientation 

double normalize(double o) 
double d = o; 

d= o - 2 * PI *(ceil((o+PI)/(2*PI))-1.0); 
return(d); 


} 
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int sum(int a, int b) 

{ 

return a+b ; 

} 

/ 5jc*5t:****5f:5jc*if:5lc*5ic*:4c*4:***************************************/ 

/* FUNCTION : detectEdgesO 
/♦ AUTHOR : KHALED HORSY 

/* PURPOSE : main function for detecting edges by tracking and 
/* random hitting 

void detectEdgesO 

{ 

IMG.LINE L, LS[20]; 

EDGE Q; 
int yy, hh ; 
int i=0; 
long rO,c; 
int a =1; 

int j = 227; /* best for 732 by 476 image size */ 

int k = 317; 

int num.hits = 0; 

long M,prn; 

POINT pO; 
double p0x,p0y; 
int K; 
prn=0; 
c=j*732+k; 

M=732*476; 

K= GetNumLinesO ; 
while(i<K) 

{ 

prn = PseudoRandom(pm,a,c,M) ; 

pO = ConvertTo2D(prn) ; 

if(significant(pO)) 

{ 

if(NotClose(pO,LS,i)) 
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{ 


TrackEdge(pO,Q,&L); 

if (L.n > 50) 

{ 


AddSegment(LS,L,i); 
printf ("\nline # ‘/.d :%f Xf %f "hf", 

i, LSCi].el.X , LS[i].el.Y, LS[i].e2.X, 
LSCi].e2.Y) ; 

i++ ; 

> 

} 

} 

} 

InitLogLines(NULL,0) ; 

LogLines(LS,K); 


> 
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4. IMAGE UNDERSTANDING LIBRARY FUNCTIONS 

******♦♦****** + ***************♦************♦***♦*****♦* 5|c jf: ***:+: 

FILE NAME : imgLib.h 
By : KHALED MORSY 

Last Update: March 17 

Purpose : set of functions required by Yamabico lU system 
collected from low-level functions (John Kisor) 
to provide easier user-level interface with the system 
** ******************* ***************************** / 

YAMAIMAGE* blImage; 

!* Function : copyImageToMemory(IMSPage frameNumber) 

Purpose ; Copies an image frame to 2-d array for edge detection 
and further lU algorithms. 

Authors : John Kisor, Khaled Morsy and L. Remias 

3*c5(c ******** ******** ***********************************************! 

void copylmageToMemory(IMSPage frameNumber) 

{ 

unsigned long pagelindex = OxfaOOOOOO; /* First empty line for pg2 pixels */ 
unsigned long page2Index = 0xfa07a400; /* First IK of pixels on page 2*/ 
unsigned long pagelSource = OxfaOOOOOO; 
unsigned long page2Source = 0xfa07a400; 


int i, j ; 

int xSize = 732; 
int ySize = 476; 

bllmage = (YAMAIMAGE*)malloc(sizeof(YAMAIMAGE)) ; 

blImage->xSize = xSize; 
blImage->ySize = ySize; 

for(i=0; i<476; i+=2) 

{ 

for (j=0; j<732; j++) /*732 bytes 

{ 


126 


bllmage->image[j][i] = *(BYTE*)pagelSource; 
bllmage->image[j][i+l]= *(BYTE*)page2Source; 


pagelSource++; 

page2Source++; 

if(j==l&& i==0) /♦ for test only */ 

printf("\n ‘/,d" , bllmage->image[j] [i] ) ; 


} 

pagelindex += 0x400; /* IK inc-new row */ 

pagelSource = pagelindex; 
page2Index += 0x400; 
page2Source = page2Index; 


> 


> 

/ ♦**♦*♦******♦♦***♦♦♦♦*****♦♦♦*****♦**♦*♦***♦*♦*♦♦%♦♦%***♦******♦♦*/ 

/* Function ; initlmgSys(void) ♦/ 

/* Purpose : initalize image grabbing system ♦/ 

/♦ Author : Khaled Morsy March 97 ♦/ 

void initlmgSys(void) 

DisableInterruptsO ; 
setInputPath(Al, CONSTANT); 
setInputPath(Bl, CONSTANT); 
setInputPath(A0, CONSTANT); 
setFirstKToConstant(A1, 0x0000); 
setFirstKToConstant(B1, 0x0000); 
setFirstKToConstant(B1, 0x0000); 


} 

I* Function : setCameraO */ 
/* Purpose : set input path to Camera for frames A0,A1,B1 */ 
/* Author : Khaled Morsy */ 
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/***♦*!»:***!('***************♦******************* **!(:*=t:********!(:^******/ 

void setCamera(void) 

{ 

setInputPath(Al, CAMERA); 
setInputPath(Bl, CAMERA); 
setInputPath(AO, CAMERA); 

} 

/********♦** *****************************************!(:*************/ 
/* Function : startGrabO */ 

/* Purpose : start grabbing cin image to frames A0,A1,B1 */ 

/* Author : Khaled Morsy March 97 */ 

void startGrab(void) 

■c 

setFrameAcquire(AO, GRAB); 
setFrameAcquire(Al, GRAB); 
setFrameAcquire(Bl, GRAB); 

} 

/* Function : stopGrabO */ 

/* Purpose : stop grabbing an image to frames A0,A1,B1 */ 

/* Author : Khaled Morsy March 97 */ 

void stopGrab(void) 

{ 

setFrameAcquire(AO, FREEZE); 
setFrameAcquire(Al, FREEZE); 
setFrameAcquire(B1, FREEZE); 

> 

/* Function : snapO */ 

/* Purpose : snap an image to frames A0,A1,B1 */ 

/* Author : Khaled Morsy March 97 (from J. Kisor) */ 

void snap(void) 

{ 

setFrameAcquire(AO, SNAP); 


128 





setFrameAcquire(Al, SNAP); 
setFrameAcquire(Bl, SNAP); 


> 




/* Function : clearFramesO 

/* Purpose : clear pixel data in A0,A1,B1 

/* Author : Khaled Morsy March 97 (from J. Kisor) 


*/ 

*/ 

*/ 


/j(::f: * :f::(!sf:sf: ♦**=(:* ^ =(:****♦♦*♦♦*=(:=(:♦*=(:*******♦* St:* ****!(::♦:♦*:+:♦* J): * s(: ♦♦****♦****♦ / 

void clearFraraes(void) 


{ 


setInputPath(Al, CONSTANT); 
setInputPathCBl, CONSTANT); 
setInputPath(AO, CONSTANT); 


} 



/* Function : interlacePage2(void) 

/* Purpose : Copies the page 2 part of an interlaced image 

between the lines of page one of the interlaced image to make 

a truly interlaced picture 

Author : John Kisor 

Comments : Khaled Morsy March 97 

void interlacePage2(void) 

{ 

int i, j; 

unsigned long pagelindex = OxfaOOOOOO; /* First empty line for pg2 pixels */ 
unsigned long page2Index = 0xfa07a400; /* First IK of pixels on page 2*/ 
unsigned long page1Destination = OxfaOOOSOO; 
unsigned long page2Source = 0xfa07a400; 

selectPage(Bl); 

for (i=0; i<236; i++) 

for (j=0; j<366; j++) { 1*732 bytes but 2 bytes per access */ 
*(WORD*)pagelDestination = *(W0RD*)page2Source; 
pagelDestination += 2; 
page2Source += 2; 


> 

pagelindex += 0x800; 
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page2Index += 0x800; 
pagelDestination = pagelindex; 
page2Source = page2Index; 


} 


} 

Function : readimage 
Author : Khaled Morsy 

****************************************************************/ 

void readimage(void) 

{ 

setCameraO ; 
startGrabO ; 
acqEnableO ; 

} 
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APPENDIX B. WORLD MODEL 


deck2.h 
Khaled Morsy 

’20 description + height of the 2nd floor points. 

Major change from the 3D model of 5th floor by J. Stein. 
Only, z-value added to a vertx. All other 
functions are deleted. 

WORLD *make_world() 

{ 

WORLD *W; 

POLYGON *Pl,*Last_p; 


/* FILE: 

By : 
Motes: 
ChctngGs: 


VERTEX *P1V1, *P1V2, ♦P1V3, *P1V4, *P1V5, *P1V6, *P1V7, 

*P1V8,*P1V9,*P1V10, *P1V11, *P1V12, *P1V13, *P1V14, 
♦P1V15,*P1V16,*P1V17, *P1V18, *P1V19, *P1V20, *P1V21, 
♦P1V22,*P1V23,*P1V24, *P1V25, ♦PmS, *P1V27, *P1V28, 
*P1V29,*P1V30,*P1V31, *P1V32, *P1V33, *P1V34, *P1V35, 
=t'PlV36,*PlV37,*PlV38, *P1V39, *P1V40, *P1V41, *P1V42, 
♦P1V43,*P1V44,*P1V45, *P1V46, *P1V47, *P1V48, *P1V49, 
*P1V50,*P1V51,*P1V52, ♦P1V53, *P1V54, *P1V55, *P1V56, 
*P1V57,*P1V58,*P1V59, ♦P1V60, *P1V61, *P1V62, *P1V63, 
*P1V64,*P1V65,*P1V66, *P1V67, *P1V68, *P1V69, *P1V70, 
♦P1V71,*P1V72,*P1V73, *P1V74, +P1V75, ♦P1V76, *P1V77, 
*P1V78,*P1V79, *P1V80, +P1V81, ♦P1V82,*P1V83, *P1V84, 
*P1V85,*P1V86, *P1V87, *P1V88, *P1V89, +P1V90, 
*P1V91,*P1V92,*P1V93, *P1V94, *P1V95, *P1V96, *P1V97, 
*P1V98,*P1V99, ♦PIVIOO, *P1V101, *P1V102, *P1V103, 
*P1V104,*P1V105,*P1V106, ♦P1V107, *P1V108, *P1V109,*P1V110, 
*P1V111,*P1V112,*P1V113, *P1V114, *P1V115, *P1V116, 
*P1V117,*P1V118,*P1V119,*P1V120, *P1V121, *P1V122,*P1V123, 
*P1V124,*P1V125, 

*P1V126,*P1V127,*P1V128, *P1V129, *P1V130, *P1V131, 
*P1V132,*P1V133,*P1V134, *P1V135, ♦P1V136, ♦P1V137, 

♦P1V138; 


W=add_world("2nd_floor",9); 
Hl=add_ph("front.hall",10,W,1,0); 
Pl=add_pg(W,1); /* CW Polygon */ 
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PlVl 

P1V2 

P1V3 

P1V4 

P1V5 

P1V6 

P1V7 

P1V8 

P1V9 

PIVIO 

PlVll 

P1V12 

P1V13 

P1V14 

P1V15 

P1V16 

P1V17 

P1V18 

P1V19 

P1V20 

P1V21 

P1V22 

P1V23 

P1V24 

P1V25 

P1V26 

P1V27 

P1V28 

P1V29 

P1V30 

P1V31 

P1V32 

P1V33 

P1V34 

P1V35 


add_vertex(Pl,0.0,0.0,257.5) ; 
add_vertex(PI,0.0,248.5,257.5) ; 

add.vertex(PI,164.7,248.5,212.5); 
add_vertex(Pl,164.7,256.75,212.5); 
add_vertex(Pl,245.7,256.75,212.5) ; 
add_vertex(Pl,245.7,248.5,212.5); 

add_vertex(Pl,390.2,248.5,212.5); 
add_vertex(Pl,390.2,256.75,212.5); 
add_vertex(Pl,491.2,256.75,212.5); 
add_vertex(Pl,491.2,248.5,212.5); 

add.vertex(PI,657.8,258.5,283.5); 
add.vertex(PI,657.8,845.5,283.5); 
add_vertex(PI,893.5,845.5,283.5); 

add_vertex(Pl,893.5,771.9,212.5); 
add_vertex(Pl,956.5,771.9,212.5); 
add_vertex(Pl,956.5,818.9,212.5); 
add.vertex(PI,1096.68,818.9,212.5); 
add_vertex(Pl,1096.68,588.9,212.5); 
add.vertex(PI,956.5,588.9,212.5); 
add.vertex(PI,956.5,635.9,212.5); 
add.vertex(PI,893.5,635.9,212.5); 

add.vert ex(P1,893.5,477.4,212.5); 
add.vertex(Pl,956.5,477.4,212.5) ; 
add.vertex(PI,956.5,524.7,212.5); 
add.vertex(Pl,1096.68,524.7,212.5); 
add.vertex(Pl,1096.68,294.7,212.5); 
add.vertex(PI,956.5,294.7,212.5); 
add.vertex(PI,956.5,341.7,212.5); 
add.vertex(PI,893.5,341.7,212.5); 
add.vertex(PI,893.5,248.5,212.5); 

add.vertex(PI,1190.5,248.5,212.5) ; 
add.vertex(PI,1190.5,256.75,212.5); 
add.vertex(Pl,1281.5,256.75,212.5); 
add.vertex(Pl,1281.5,248.5,212.5); 

add.vertex(Pl,1592.5,248.5,212.5) ; 


/* beside graphics lab */ 
/*rm 253A */ 

/* EXIT DOOR */ 


/* elev 1 */ 


/* elev 2 */ 


/* end of elevators region (comer! 
/* room 261 */ 

/* room 259 */ 
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P1V36 

P1V37 

P1V38 

P1V39 

P1V40 

P1V41 

P1V42 

P1V43 

P1V44 

P1V45 

P1V46 

P1V47 

P1V48 

P1V49 

P1V50 

P1V51 

P1V52 

P1V53 

P1V54 

P1V55 

P1V56 

P1V57 

P1V58 

P1V59 

P1V60 

P1V61 

P1V62 

P1V63 

P1V64 

P1V65 

P1V66 

P1V67 

P1V68 


add_vertex(Pl,1592 
add_vertex(Pl,1683 
add_vertex(Pl,1683 

add_vertex(Pl,2159 
add_vertex(Pl,2159 
add_vertex(Pl,2250 
add_vertex(Pl,2250 

add_vertex(Pl,2587 
add_vertex(Pl,2587 
add_vertex(Pl,2678 
add_vertex(PI,2678 

add_vertex(Pl,3692 

add_vertex(Pl,3692 
add_vertex(PI,3683 
add_vertex(Pl,3683 
add_vertex(Pl,3692 

add_vertex(Pl,3692 
add_vertex(Pl,3711 
add_vertex(Pl,3711 
add_vertex(Pl,3962 

add_vertex(Pl,3962 
add_vertex(Pl,3970 
add_vertex(Pl,3970 
add_vertex(Pl,3962 

add_vertex(Pl,3962 

add_vertex(Pl,5269 
add_vertex(PI,5269 
add_vertex(Pl,5360 
add_vertex(Pl,5360 

add_vertex(Pl,5680 
add_vertex(Pl,5680 
add_vertex(Pl,5819 
add_vertex(Pl,5819 


.5,256.75,212.5); 
.9,256.75,212.5); 

9.248.5.212.5) ; 

.3,248.5,212.5) ; 

3.256.75.212.5) ; 

3.256.75.212.5) ; 
3, 248.5,212.5); 

3.248.5.212.5) ; 

3.256.75.212.5) ; 

3.256.75.212.5) ; 

3.248.5.212.5) ; 

0, 248.5,212.5); 

0,457.0,212.5); 
75,457.0,212.5); 
75,548.0,212.5); 
0,548.0,212.5); 

.0,212.5); 
.6,727.3,283.5); 

6.809.7.283.5) ; 

2.809.7.283.5) ; 

2.212.5) ; 

45.358.9.212.5) ; 

45.267.9.212.5) ; 

2.267.9.212.5) ; 

2.248.5) ; 

.7,248.5,212.5); 

7.256.75.212.5) ; 
.7,256.75,212.5); 
.7,248.5,212.5); 

.7,248.5,212.5); 
.7,375.0,212.5); 
.2,375.0,212.5); 
.2,248.5,212.5); 


/* room 257 */ 

/* room 243 *! 

/* corner */ 

/* room 241 */ 

/* corners */ 

/* room 239 */ 

/♦ comer */ 

/♦ 235 */ 

/* front of 231 */ 
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P1V69 

P1V70 

P1V71 ^ 
P1V72 ^ 
P1V73 ^ 
P1V74 ^ 

P1V75 ^ 
P1V76 ^ 
P1V77 ^ 
P1V78 ^ 

P1V79 ^ 
P1V80 ^ 
P1V81 ^ 
P1V82 ^ 

P1V83 ^ 
P1V84 ^ 
P1V85 ^ 
P1V86= 

P1V87 ^ 
P1V88 ^ 
P1V89 ^ 
P1V90 : 

P1V91 ^ 
P1V92 ^ 
P1V93 ^ 
P1V94 ^ 

P1V95 ^ 
P1V96 ^ 
P1V97 ^ 
P1V98 ^ 

P1V99 

PIVIOO 

PIVIOI 


■ add_vertex(Pl,6239 
: add_vertex(Pl,6239 

add_vertex(Pl,5932 
add_vertex(Pl,5932 
add_vertex(Pl,5841 
add_vertex(Pl,5841 

add_vertex(PI,5364 
add_vertex(Pl,5364 
add_vertex(PI,5273 
add_vertex(Pl,5273 

add_vertex(Pl,4976 
add_vertex(Pl,4976 
add_vertex(Pl,4884 
add_vertex(Pl,4884 

add_vertex(Pl,4796 
add_vertex(Pl,4796 
add_vertex(Pl,4715 
add_vertex(Pl ,4715.' 


.2,248.5,257.5) ; 
.2,0.0,257.5); 

7,0.0,212.5); 
7,-8.25,212.5); 
7,-8.25,212.5); 
7,0.0,212.5); 

2,0.0,212.5); 
2,-8.25,212.5); 
2,-8.25,212.5); 
2,0.0,212.5); 

0,0.0,212.5); 
0,-8.25,212.5); 
2,-8.25,212.5); 
2,0.0,212.5); 

2,0.0,212.5); 
2,-8.25,212.5); 
7,-8.25,212.5); 
■ ,0.0,212.5); 


/* end of hallway */ 
/* end of hallway =*=/ 

I* rm 228 */ 

I* rm 226 ♦/ 

/* rm 230 */ 

/* 232 ♦/ 


add_vertex(PI,4534.0, 0.0,212.5); /♦ around water cooler */ 

add_vertex(Pl,4534.0, -48.0,212.5); 
add_vertex(Pl,4447.9, -48.0,212.5); 
add_vertex(Pl,4447.9, 0.0,212.5); 

add_vertex(P 1,4367.9, 0.0,212.5); /=*= 234 */ 

add_vertex(Pl,4367.9, -8.25,212.5); 
add_vertex(PI,4287.4 ,-8.25,212.5); 
add_vertex(Pl,4287.4, 0.0,212.5); 


add_vertex(P1,4228.4, 0.0,212.5); h EXIT B*/ 
add_vertex(Pl,4228.4, -8.25,212.5); 
add_vertex(Pl,4046.4, -8.25,212.5); 
add_vertex(Pl,4046.4, 0.0,212.5); 

= add_vertex(Pl,3999.8, 0.0,212.5); /* 236 ♦/ 

= add_vertex(Pl,3999.8, -8.25,212.5); 

= add_vertex(Pl,3919.3, -8.25,212.5); 
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P1V102 

P1V103 

P1V104 

P1V105 

P1V106 

P1V107 

P1V108 

P1V109 

PlVllO 

PlVlll 

P1V112 

P1V113 

P1V114 

P1V115 

P1V116 

P1V117 

P1V118 

P1V119 

P1V120 

P1V121 

P1V122 

P1V123 

P1V124 

P1V125 

P1V126 

P1V127 

P1V128 

P1V129 

P1V130 

P1V131 

P1V132 

P1V133 

P1V134 


add_vertex(Pl,3919.3, 0.0,212.5); 


add_vertex(Pl,3351 
add_vertex(Pl,3351 
add_vertex(Pl,3260 
add_vertex(Pl,3260 

add_vertex(Pl,3018 
add_vertex(Pl,3018 
add_vertex(Pl,2927 
add_vertex(Pl,2927 

add_vertex(Pl,2123 
add_vertex(Pl,2123 
add_vertex(Pl,2032 
add_vertex(Pl,2032 

add_vertex(Pl,1971 
add_vertex(Pl,1971 
add_vertex(Pl,1880 
add_vertex(Pl,1880 

add_vertex(Pl,1553 
add_vertex(Pl,1553 
add_vertex(Pl,1462 
add_vertex(Pl,1462 

add_vertex(Pl,1311 
add_vertex(Pl,1311 
add_vertex(Pl,1220 
add_vertex(Pl,1220 


.3, 

0.0,212.5); 

/* 

.3, 

-18.25,212.5) 

} 

.3, 

-18.25,212.5) 

f 

.3, 

0.0 ,212.5); 


.8, 

0.0,212.5); 

/* 

.8, 

-8.5,212.5); 


.0, 

-8.5,212.5); 


.0, 

0.0,212.5); 


.0, 

0.0,212.5); 

/* 

.0, 

-8.5,212.5); 


,-8 

.5,212.5); 


,0. 

0,212.5); 


•4, 

0.0,212.5); 

!* 

•4, 

-8.25,212.5); 


.4, 

-8.25,212.5); 


•4, 

0.0,212.5); 


.2, 

0.0,212.5); 

/* 

.2, 

-8.25,212.5); 


.2, 

-8.25,212.5); 


.2, 

0.0,212.5); 


.9, 

0.0,212.5); 

/* 

.9, 

-8.25,212.5); 


.9, 

-8.25,212.5); 


.9, 

0,212.5); 



add_vertex(Pl,834.4,0.0,212.5) ; /♦ 

add_vertex(Pl,834.4,-8.25,212.5); 
add_vertex(Pl,743.4,-8.25,212.5); 
add_vertex(Pl,743.4,0.0,212.5); 


add_vertex(Pl,417.1,0.0,212.5); /* 

add_vertex(Pl,4l7.1,-8.25,212.5); 
add_vertex(Pl,326.1,-8.25,212.5); 
add_vertex(Pl,326.1,0.0,212.5); 


rm 238 */ 


240 maze room*/ 


242 yamabico lab */ 


rm 244 ♦/ 


rm 246 */ 


rm 248 ♦/ 


rm 250 ♦/ 


rm 252 */ 
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P1V135 = 
P1V136 = 
P1V137 = 
P1V138 = 
/*** end 


add_vertex(Pl,265.9,0.0,212.5); /* rm 254 */ 

add_vertex(Pl,265.9,-8.25,212.5); 
add_vertex(Pl,174.9,-8.25,212.5); 
add_vertex(Pl,174.9,0.0,212.5); 
of floor points */ 
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